AI Co-Scientist for Protein Function Prediction and Hypothesis Generation

Project Repository Video Presentation Edit

Project Themes:

  • Virtual AI Co-scientist

Team Lead(s):

Suggested Team Members and Roles [4-6 members]

NameAffiliationRole / Expertise
Arvind RamanathanArgonne National LaboratoryPrincipal Investigator, AI Systems Architect
Archit VasanArgonne National LaboratoryModel integration and deployment
Bharat KaleArgonne National LaboratoryUse-case definition and validation
Brian HsuArgonne National LaboratoryModel integration and deployment
Ozan GokdemirArgonne National LaboratoryRAG and Data Integration Specialist
Ruijie ZhuUniversity of ChicagoUse-case definition and validation

Project Summary

This project extends the open-source Co-Scientist framework (https://github.com/acadev/Jnana) to build reasoning agents for protein function prediction, using BV-BRC datasets as testbeds. The goal is to enable hypothesis-driven dialogue and AI-assisted exploration of uncharacterized genes and proteins relevant to priority pathogens.

Goals and Objectives

  • Goal 1: Deploy Co-Scientist framework in the BV-BRC environment
  • Goal 2: Implement a reasoning module for protein function prediction using multi-modal evidence (sequence, structure, literature)
  • Goal 3: Demonstrate AI-guided hypothesis generation for uncharacterized genes in one pathogen model

Approach

The Co-Scientist framework will use an agentic reasoning loop combining retrieval (via HiPerRAG), synthesis (via Rhea), and hypothesis refinement through dialogue. The system will be trained on BV-BRC pathogen data and evaluated for its interpretability and accuracy in protein function annotation.

Data and Resources Required

Resource TypeSource / LinkDescription / Purpose
DataBV-BRC genomes and curated datasetsTraining and evaluation data
Tools / ServicesJnana (Co-Scientist GitHub repo)Agentic reasoning system
LLMs / AI ModelsGPT-4, Claude 3Reasoning and synthesis agents
Compute / StorageArgonne HPCExperimentation environment

Expected Outcomes / Deliverables

Prototype AI Co-Scientist agent capable of hypothesis-driven protein function prediction and reasoning trace visualization.

Potential Impact and Next Steps

This project showcases autonomous scientific reasoning and interactive discovery for infectious disease research. It will help BV-BRC develop transparent, AI-assisted research workflows and foster a foundation for broader agentic science platforms.

Technical Support Needed

  • GPU / LLM access
  • Mentor support
  • API keys

Additional Comments