I would like to confirm which information was utilized to construct the embedding_v1 variable, which is stored in the google_patents_research.publications table. Specifically, was the model trained using:
-
The full patent text (specifications)?
-
Images (patent drawings)?
-
Citation data (references)?
Were all three of these information sources used?"