Question 1

What is named entity recognition in simple terms?

Accepted Answer

Named entity recognition is the task of scanning text and labelling the spans that refer to real-world things such as people, organisations, locations, dates, and amounts. Instead of just reading words, the system tags which words form a meaningful entity and what type it is. It turns unstructured prose into structured, queryable data.

Question 2

What is BIO tagging?

Accepted Answer

BIO tagging is the standard scheme for marking entity spans token by token. B marks the beginning of an entity, I marks a token inside the same entity, and O marks a token outside any entity. So 'New York City' becomes B-LOC I-LOC I-LOC, which lets a model represent multi-word entities without ambiguity.

Question 3

Can large language models do NER without training?

Accepted Answer

Yes. You can prompt an LLM to extract entities and return them as JSON, and it will often do well on common entity types with no fine-tuning. The trade-offs are cost, latency, and consistency: a small dedicated NER model can be cheaper and more reliable at scale, while an LLM is faster to set up and handles unusual entity types more flexibly.

Question 4

What are NER systems actually used for?

Accepted Answer

NER powers search and indexing, redaction of personal data, resume and invoice parsing, clinical and legal document analysis, knowledge-graph construction, and feeding downstream tasks like relation extraction. Anywhere you need to pull structured facts out of free text, NER is usually the first step.

What Is Named Entity Recognition (NER)?

What named entity recognition does

NER as a sequence-labelling task

How NER models evolved

NER in the LLM era

Why NER still matters