_________ is a field of computer science dedicated to solving
problems that we commonly associate with human intelligence
Artificial Intelligence
Used to generate new data that is similar to the data it was trained on
* Text
* Image
* Audio
* Code
* Video…
Generative AI
To generate data, we must rely on a __________
* ___________ are trained on a wide variety of input data
* The models may cost tens of millions of dollars to train
Foundation Model
Type of AI designed to generate coherent human-like text
* One notable example: GPT-4 (ChatGPT / Open AI)
* Trained on large corpus of text data
* Usually very big models
* Billions of parameters
* Trained on books, articles, websites, other textual data
* Can perform language-related tasks
* Translation, Summarization
* Question answering * Content creatio
Large Language Models (LLM)
We usually interact with the LLM by giving a ____
prompt
What is the term for below: the generated text may be different for every user that uses
the same prompt
Non-deterministic:
What’s Amazon Titan?
What term goes with this:
-Adapt a copy of a foundation model with your own data
Fine Tuning
*Improves the performance of
a pre-trained FM on domain-specific tasks
* = further trained on a
particular field or area of
knowledge
Instruction based fine tuning
-make a model expert in a specific domain
* For example: feeding the entire AWS
documentation to a model to make it an expert on AWS
* Good to feed industry-specific terminology
into a model (acronyms, etc…)
* Can continue to train the model as more
data becomes available
domain-adaptation fine-tuning
single turn messaging
multi turn messaging
True or false: Instruction-based fine-tuning is usually cheaper than re training an FM as computations are
less intense and the amount of data required usually less
true
_________ the broader concept of re-using a pre-trained model to adapt it to a new related task
* Widely used for image classification
* And for NLP (models like BERT and GPT)
transfer learning
This is a good use case of _____
fine tuning
What does it mean to automatically evaluate a model?
Evaluate a model for quality control.
Scores are calculated automatically
What does it mean to have human evaluation of a model?
benchmark datasets
_________
* Semantic similarity between generated text
* Uses pre-trained ___ models (Bidirectional Encoder Representations from Transformers) to compare the
contextualized embeddings of both texts and computes the cosine similarity between them.
* Capable of capturing more nuance between the texts
BLEU: Bilingual Evaluation Understudy
Evaluating automatic summarization and machine translation systems
* ____-N – measure the number of matching n-grams between reference and generated text
* _____–L – longest common subsequence between reference and generated text
search & analytics database
real time similarity queries, store millions of vector embeddings
scalable index management, and fast nearest-neighbor (kNN) search capability
Amazon OpenSearch Service
[with MongoDB compatibility] – NoSQL database
real time similarity queries, store millions of vector embeddings
Amazon DocumentDB