Codestin Search App

Paper Assistant: Organize and Explore Research Papers.

Purpose

The main goal here is to just have a better way of organizing papers for readers of Daily Papers. It leverages Neo4j and Google AI Studio's Gemini 2.0 Flash to create a knowledge graph of papers, their concepts, and relationships, allowing you to easily discover connections and insights. It's designed for readers of Daily Papers who want a deeper understanding of the research landscape.

Setup

To get started, you'll need to do the following:

Install Neo4j Desktop: Download and install Neo4j Desktop. This is necessary to create and manage your local graph database.
Create a Neo4j Database:
- Open Neo4j Desktop.
- Create a new DBMS.
- Inside the DBMS, create a new database (e.g., named paper_assistant).
- Start the database.
Set the DATABASE Environment Variable:
- Create a .env file in the root of your project directory.
- Add the DATABASE variable to the .env file. The value should be the name of your database.
Get a Google AI Studio API Key:
- Go to Google AI Studio and create an account (if you don't already have one).
- Create a new API key.
- Add the GEMINI_API_KEY variable to your .env file: GEMINI_API_KEY=YOUR_GEMINI_API_KEY
Install Python Dependencies:
- Run pip install -r requirements.txt to install the required Python packages, including docling, which is used for text processing and analysis.
Note:
- The pdf for the papers processed are stored in the papers/ directory, and the corresponding markdowns are stored in the markdown/ directory. A few of these have been added as a starting point.

Usage

In order to get started, you should first populate the DATABASE and GEMINI_API_KEY in the `.env.

There are 2 user-facing scripts -

store.py:
- This script is used for processing today's papers and storing them in the graph db.
- To process the papers already provided in the papers/ directory, you can run this script with --existing.
- The data should now be available in the database.
retrieve.py: This script allows you to interact with the db and explore the relationships between papers, concepts, and clusters.
- Example Queries:
  - "What's paper all about"
  - "List various approaches related to "
- Exiting the Loop: Type "exit" / "q" / "quit" to end the loop.

Images

Here's a visualization of the knowledge graph generated from the existing papers in the papers/ directory:

The graph shows papers (orange), concepts (blue), and clusters (green). This visualization helps understand the connections between research areas.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
markdown		markdown
papers		papers
.env		.env
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
db.py		db.py
graph.png		graph.png
llm.py		llm.py
models.py		models.py
prompts.py		prompts.py
requirements.txt		requirements.txt
retrieve.py		retrieve.py
store.py		store.py
tests.py		tests.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Paper Assistant: Organize and Explore Research Papers.

Purpose

Setup

Usage

Images

About

Uh oh!

Languages

License

vedpatwardhan/paper-assistant

Folders and files

Latest commit

History

Repository files navigation

Paper Assistant: Organize and Explore Research Papers.

Purpose

Setup

Usage

Images

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages