What are vector embeddings? How do they play a part in RAG?
Embedding is the process by which text, images, and audio are given numerical representation in a vector space. Embedding is usually performed by a machine learning (ML) model.
In RAG, enterprise datasets, such as documents, images and audio, are passed to ML models as tokens and are vectorized. These vectors in an n-dimensional space, along with the metadata about them, are stored in purpose-built vector databases for faster retrieval.
What is a vector database?
What are Agents in an AI system?
Agents interact with the environment to perform intermediary operations
Coordinate multi-step functions
Example of an agent:
A chatbot may have an agent to modify/reset a customer’s password or phone plan
Another agent may send a CSAT survey to the customer when the conversation ends.
How do you evaluate a Gen AI system?
What is involved in creating a benchmark dataset?
SMEs have to do this manually.
They create intelligent questions.
Then they craft answers for them.
These datasets are then used to judge the performance of the model.
A “judge model” could be used to automate this process - i.e. a Judge Model takes the output of the model under evaluation and compares it to the benchmark dataset created by the SME and issue a grading score.
What are the benefits of fine tuning?
What are the different types of FT?
What are the key steps in data preparation for fine tuning?
What are a few standard metrics for evaluating LLMs?