Marvin
MarvinMetadataExtractor #
Bases: BaseExtractor
Source code in llama-index-integrations/extractors/llama-index-extractors-marvin/llama_index/extractors/marvin/base.py
22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 |
|
llm_model_string
class-attribute
instance-attribute
#
llm_model_string: Optional[str] = Field(description='The LLM model string to use for extracting custom metadata')
Metadata extractor for custom metadata using Marvin.
Node-level extractor. Extracts
marvin_metadata
metadata field.
Args:
marvin_model: Marvin model to use for extracting metadata
llm_model_string: (optional) LLM model string to use for extracting metadata
Usage:
#create extractor list
extractors = [
TitleExtractor(nodes=1, llm=llm),
MarvinMetadataExtractor(marvin_model=YourMarvinMetadataModel),
]
#create node parser to parse nodes from document
node_parser = SentenceSplitter(
text_splitter=text_splitter
)
#use node_parser to get nodes from documents
from llama_index.ingestion import run_transformations
nodes = run_transformations(documents, [node_parser] + extractors)
print(nodes)