Lilac
LilacReader #
Bases: BaseReader
Lilac dataset reader.
Source code in llama-index-integrations/readers/llama-index-readers-lilac/llama_index/readers/lilac/base.py
11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 |
|
load_data #
load_data(dataset: str, text_path: Path = 'text', doc_id_path: Optional[Path] = 'doc_id', columns: Optional[List[ColumnId]] = None, filters: Optional[List[FilterLike]] = None, project_dir: Optional[str] = None) -> List[Document]
Load text from relevant posts and top-level comments in subreddit(s), given keyword(s) for search.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
project_dir |
Optional[str]
|
The Lilac project dir to read from. If not defined, uses the |
None
|
text_path |
Path
|
The path to the text field in the dataset. If not defined, uses 'text'. |
'text'
|
columns |
Optional[List[ColumnId]]
|
The columns to load from the dataset. If not defined, loads all columns. |
None
|
dataset |
str
|
The dataset to load. Should be formatted like {namespace}/{dataset_name}. |
required |
filters |
Optional[Filter]
|
A filter to apply to the dataset before loading into documents. Useful to filter for labeled data. |
None
|
Source code in llama-index-integrations/readers/llama-index-readers-lilac/llama_index/readers/lilac/base.py
16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 |
|