299 questions · Browse, filter, and study at your own pace
View Saved QuestionsWhat is catastrophic forgetting and how do you mitigate it in fine-tuning?
What is reranking in RAG?
What is the difference between semantic search and keyword search, and when do you combine them?
What is agent memory and what are the different types?
What is the difference between full fine-tuning and PEFT?
What is speculative decoding and how does it speed up inference?
What is DPO (Direct Preference Optimization) and how does it compare to RLHF?
What is the feed-forward network in a transformer?
What is the HuggingFace Transformers library?
What are the different types of memory in AI agents?
What is Grouped Query Attention (GQA) and why is it used in modern LLMs?
What frameworks are used to build AI agents?
What is Constitutional AI and how does Anthropic use it?
What is the difference between Pinecone and pgvector?
How do you scale an LLM application to 1 million users?
What is GRPO (Group Relative Policy Optimization) and how does it enable LLMs to reason?
What is the difference between SFT and RLHF?
What is the champion-challenger pattern in MLOps?
What tools can AI agents use?