Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- neuml/paperai "indexes databases previously built with paperetl" and does RAG with txtai; https://github.com/neuml/paperai :
- > paperai is a combination of a txtai embeddings index and a SQLite database with the articles. Each article is parsed into sentences and stored in SQLite along with the article metadata. Embeddings are built over the full corpus.
- paperai has a YAML report definition schema that's probably useful for meta-analyses.
- Paperetl can store articles with SQLite, Elasticsearch, JSON, YAML. It doesn't look like markdown from a tagged git repo is supported yet. Supported inputs include PDF, XML (arXiv, PubMed, TEI), CSV.
- PaperQA2 has a CLI: https://github.com/Future-House/paper-qa#what-is-paperqa2 :
- > PaperQA2 is engineered to be the best agentic RAG model for working with scientific papers.
- > [ Semantic Scholar, CrossRef, ]
- paperqa-zotero: https://github.com/lejacobroy/paperqa-zotero
- The Oracle of Zotero is a fork of paper-qa with FAISS and langchain: https://github.com/Frost-group/The-Oracle-of-Zotero
Advertisement
Add Comment
Please, Sign In to add comment