5 terms
Showing all terms starting with K
A structured repository of information used by AI systems to answer questions, often combined with RAG for accurate retrieval.
A structured representation of real-world entities and their relationships, used to enhance AI reasoning and question answering.
A simple ML algorithm that classifies data points based on the majority class of their k closest neighbours in feature space.
A model compression technique where a smaller student model is trained to mimic the behaviour of a larger teacher model.
Key-value cache used during LLM inference to store previously computed attention states, dramatically speeding up text generation.