segram.nlp.pipeline.coref module
Segram coreference pipeline component.
- class segram.nlp.pipeline.coref.Coref(nlp: Language, name: str, model: Language, components: Sequence[str] | None = None)[source]
Bases:
objectCoreference resolution pipeline component based on
spacycoref component.- name
Pipe name.
- model
Language model for coreference resolution.
- __init__(nlp: Language, name: str, model: Language, components: Sequence[str] | None = None) None[source]
Initilization method.
- Parameters:
nlp – Main language model.
model – Name of a coreference language model.
components – Names of pipeline component names to include. Use all if
None.
- Raises:
ValueError – If
componentsare empty but notNone.
- set_corefs(doc: Doc, cluster: Sequence[int]) None[source]
Set proper coreferences from pronoun tokens to closest non-pronoun neighbors within the
cluster.Notes
Coreferences are stored as token indexes (integers) in
_refcustom attribute on tokens.
- classmethod from_model(nlp: Language, name: str, model: str, components: Sequence[str] | None = None, **kwds: Any) Self[source]
Initialize from model name.
**kwdsare passed tospacy.load().