SkillAgentSearch skills...

Codesearch.AI

codesearch.ai semantic code search engine

Install / Use

/learn @sourcegraph/Codesearch.AI
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

codesearch.ai

codesearch.ai is a semantic code search engine. It allows searching GitHub functions and StackOverflow answers using natural language queries. It uses HuggingFace Transformers under the hood, and the training procedure is inspired by a paper called Text and Code Embeddings by Contrastive Pre-Training from OpenAI. The CodeSearchNet project served as a basis for data collection and cleaning.

The project is split into two sub-projects: data collection and model training. The codesearch-ai-data folder corresponds to the data collection part written in Go. And the codesearch_ai_ml folder corresponds to the model training part written in Python.

Requirements

  • Go >= 1.18
  • Python >= 3.7
  • CUDA (for GPU model training)
  • Postgres

Code walkthrough

We prepared a detailed code walkthrough in the form of a Sourcegraph Notebook.

View on GitHub
GitHub Stars42
CategoryDevelopment
Updated1mo ago
Forks10

Languages

Go

Security Score

90/100

Audited on Feb 13, 2026

No findings