TopRank Skills

Home / Claw Skills / Recherche / ai-engineer
Official OpenClaw rules 54%

ai-engineer

AI/ML engineering specialist for building intelligent features, RAG systems, LLM integrations, data pipelines, vector search, and AI-powered applications. Use when building anything involving: LLMs, embeddings, vector databases, RAG, fine-tuning, prompt engineering, AI agents, ML pipelines, or deploying models to production. NOT for general web dev (use rapid-prototyper) or simple API calls.

Stars

0

Installs

0

Status

ACTIVE

Visibility

PUBLIC

安装方式

直接复制以下提示词,发送给你的 AI 助手即可完成安装。

请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 ai-engineer 技能。 若已安装,则直接安装 ai-engineer 技能。

Overview

Skill Key
bullkis1/ai-engineer
Author
bullkis1
Source Repo
openclaw/skills
Version
-
Source Path
skills/bullkis1/ai-engineer
Latest Commit SHA
71306517c6d62af8cea18230e8401363d8c22f85

Extracted Content

SKILL.md excerpt

# AI Engineer

Build practical AI systems that work in production. Data-driven, systematic, performance-focused.

## Core Capabilities

- **LLM Integration**: OpenAI, Anthropic, local models (Ollama, llama.cpp), LiteLLM
- **RAG Systems**: Chunking, embeddings, vector search, retrieval, re-ranking
- **Vector DBs**: Chroma (local), Pinecone (managed), Weaviate, FAISS, Qdrant
- **Agents & Tools**: Tool-calling, multi-step agents, OpenClaw sub-agents
- **Data Pipelines**: Ingestion, cleaning, transformation, feature engineering
- **MLOps**: Model versioning (MLflow), monitoring, drift detection, A/B testing
- **Evaluation**: Benchmark construction, bias testing, performance metrics

## Decision Framework

### Which LLM provider?
- **Prototyping/speed**: OpenAI GPT-4o or Anthropic Claude Sonnet
- **Local/private**: Ollama + Qwen 2.5 32B or Llama 3.3 70B
- **Multi-provider abstraction**: LiteLLM (swap models without code changes)
- **Embeddings**: text-embedding-3-small (OpenAI) or nomic-embed-text (local)

### Which vector DB?
- **Local/dev**: Chroma (zero setup)
- **Production managed**: Pinecone
- **Self-hosted production**: Qdrant or Weaviate
- **Already in Postgres**: pgvector extension

### RAG or fine-tuning?
- **RAG first** — always try RAG before fine-tuning. 90% of cases RAG is enough.
- Fine-tune only when: style/tone change needed, domain vocab is highly specialized, latency must be minimal

## RAG Workflow

### 1. Ingest
```python
# Chunk documents (rule of thumb: 512 tokens, 50 overlap)
from langchain.text_splitter import RecursiveCharacterTextSplitter
splitter = RecursiveCharacterTextSplitter(chunk_size=512, chunk_overlap=50)
chunks = splitter.split_documents(docs)
```

### 2. Embed + store
```python
import chromadb
from chromadb.utils.embedding_functions import OpenAIEmbeddingFunction

client = chromadb.PersistentClient(path="./chroma_db")
ef = OpenAIEmbeddingFunction(api_key=os.environ["OPENAI_API_KEY"], model_name="text-embedding-3-small")
collection = c...

Related Claw Skills