What are The 5 ML Development Life Cycle Stages
How to identifying opportunities
1. Business Needs
2. Focus on Value Creation:
3. Consider Feasibility:
Examples:
Start with readily available data: If you have existing customer data or operational logs, explore how they can be used to build initial AI models.
Utilize AWS Free Tier: Leverage AWS Free Tier to experiment with AI services and build proof-of-concept projects without significant upfront costs.
Engage with AWS Partner Network: Collaborate with AWS Partners to access specialized expertise and accelerate AI implementation.
**What are the 3 evaluation typs of entripise data ? **
1. Data Aware: Primarily records and stores data, without active use in decision-making; responsibility for data is not clearly defined.
2. Data Informed: Analyzes data to inform decisions, with specific roles assigned for managing data; employs interactive data tools for insights.
3. Data Driven: Integrates data into all decision-making processes, with a company-wide commitment to data; utilizes advanced technologies like AI for strategic actions.
1. Data Aware:
**Capability: **Focus is on the collection of data with an emphasis on knowing what has happened, signifying a basic level of data utilization.
**Ownership: **There’s no designated responsibility for data management, indicating a lack of strategic importance placed on data assets.
**Technology: **Relies on legacy systems (old guard databases) that support simple data storage without advanced analysis or integration capabilities.
2. Data Informed:
**Capability: **Uses insights gained from data analysis to understand why events have occurred, pointing to a more analytical approach to data.
Ownership: Data is managed by designated individuals or teams, reflecting an organizational move towards recognizing data as a valuable asset.
Technology: Incorporates more sophisticated tools like data warehouses and interactive queries, allowing for deeper analysis and pattern recognition.
3. Data Driven:
**Capability: **Prioritizes action based on data analytics, with the aim to influence future outcomes, demonstrating the highest level of data maturity.
**Ownership: **Data is a responsibility shared across the organization, indicating a culture that values and utilizes data in decision-making processes.
Technology: Employs cutting-edge technologies like cloud computing, artificial intelligence, machine learning, and generative AI, enabling real-time analysis and proactive decision-making.
Define High-Impact AI Initiatives (HI-AIs)
It involves:
1. Identifying or recognizing potential AI initiatives (PAIs)
What is Amazon CodeWhisperer?
Amazon CodeWhisperer is an AI-powered coding assistant that provides real-time, context-aware code suggestions to enhance developer productivity
What is ML?
“ML” stands for Machine Learning, which is a branch of artificial intelligence (AI). It involves training algorithms to make predictions or decisions based on data. Machine learning models automatically improve their performance as they are exposed to more data over time. It’s used in a wide range of applications, from recommendation systems in online platforms to autonomous vehicles.
What are the 3 types of ML algorithms?
Examples:
Supervised Learning
* Spam prediction, Fraudulent transaction detectior
* Customer churn prediction
* Machine failure prediction
* Forecasting staffing levels
* Forecasting raw material prices
* Forecasting consumer demand
Unsupervised Learning
* Micro-segmentation of custoers
* Recommendations of products to purchase
* Customer behavior analysis (market basket
What is Deep Learning?
Let’s take the house price prediction problem that we talked about earlier. With neural networks, you can present all possible input -from zip code to neighborhood to the average median price of houses in the city. Neural networks can decide which features are essential and which ones can be excluded or relied on less. These networks will also learn which combination of input can make the best prediction. In contrast, with traditional ML, you’d have to experiment with different combinations of features. Even though a neural network can automatically learn which features to use, for it to work as expected, the network needs to be designed correctly and fed large volumes of high-quality training data.
Provide two types of GenAI models
Transformers
Description: Transformers are an innovative type of neural network architecture that can understand and process the context of words within a sentence or text sequence. They employ mechanisms such as self-attention to assign appropriate weights to the words based on their context within the input sequence.
GANs (Generative Adversarial Networks)
Description: GANs are a type of generative model characterized by their two main components: a generator and a discriminator. The generator attempts to produce synthetic data, such as images, while the discriminator aims to differentiate between real and generated data. GANs are built on an adversarial training paradigm whereby the generator and discriminator engage in a competitive dialogue, resulting in the generator improving its capacity to create more realistic outputs.
How do large language models work?
What are applications of large language models?
“There are many practical applications for LLMs.
Copywriting
Apart from GPT-3 and ChatGPT, Claude, Llama 2, Cohere Command, and Jurassiccan write original copy. AI21 Wordspice suggests changes to original sentences to improve style and voice.
Knowledge base answering
Often referred to as knowledge-intensive natural language processing (KI-NLP), the technique refers to LLMs that can answer specific questions from information help in digital archives. An example is the ability of AI21 Studio playground to answer general knowledge questions.
Text classification
Using clustering, LLMs can classify text with similar meanings or sentiments. Uses include measuring customer sentiment, determining the relationship between texts, and document search.
Code generation
LLM are proficient in code generation from natural language prompts. Examples include Amazon CodeWhisperer and Open AI’s codex used in GitHub Copilot, which can code in Python, JavaScript, Ruby and several other programming languages. Other coding applications include creating SQL queries, writing shell commands and website design.
Text generation
Similar to code generation, text generation can complete incomplete sentences, write product documentation or, like Alexa Create, write a short children’s story.”
What is natural language processing?
How Language Models work?
Understanding Language Models
What is Amazon quickSight (Gen AI serverless for BI)
How Amazon Web Services (AWS) Services for GAN Support?
Amazon Web Services (AWS) Services for GAN Support
What are AWS Deep Learning Services?
AWS Deep Learning Services
- Utilize cloud computing to scale deep learning neural networks cost-effectively and optimize for speed.
- AWS offers specific services for fully managing deep learning applications:
- Amazon Rekognition: Incorporate pretrained or customizable computer vision features into applications.
- Amazon Transcribe: Automatically recognize and transcribe speech accurately.
- Amazon Lex: Build intelligent chatbots proficient in understanding intent, conversational context, and task automation across multiple languages.
- Amazon SageMaker: Provides a swift and straightforward approach to constructing, training, and deploying neural networks at scale for deep learning on AWS.
- AWS Deep Learning AMIs: Create customized environments and workflows for deep learning applications.
- Begin your deep learning journey on AWS by signing up for a free AWS account today!
What are Foundation Models ?
How AWS Can Help with foundation models?
What are examples of foundation models?
Claude
- Claude 2 is Anthropic’s state-of-the-art model that excels at thoughtful dialogue, content creation, complex reasoning, creativity, and coding, built with Constitutional AI. Claude 2 can take up to 100,000 tokens in each prompt, meaning it can work over hundreds of pages of text, or even an entire book. Claude 2 can also write longer documents—like memos and stories on the order of a few thousand tokens—compared to its prior version.
GPT
- The Generative Pre-trained Transformer (GPT) model was developed by OpenAI in 2018.
AI21 Jurassic
- Released in 2021, Jurassic-1 is a 76-layer auto-regressive language model with 178 billion parameters. Jurassic-1 generates human-like text and solves complex tasks. Its performance is comparable to GPT-3.
Amazon Titan
- Exclusive to Amazon Bedrock, the Amazon Titan family of models incorporates Amazon’s 25 years of experience innovating with AI and machine learning across its business. Amazon Titan foundation models (FMs) provide customers with a breadth of high-performing image, multimodal, and text model choices, via a fully managed API.
- Amazon Titan models are created by AWS and pretrained on large datasets, making them powerful, general-purpose models built to support a variety of use cases, while also supporting the responsible use of AI. Use them as is or privately customize them with your own data.
What is Amazon SageMaker designed to help data scientists and developers accomplish?
Amazon SageMaker is designed to help data scientists and developers prepare data, and build, train, and deploy machine learning models quickly by integrating purpose-built capabilities. This allows for the construction of highly accurate models with less effort spent on managing ML environments and infrastructure.
How does SageMaker Data Wrangler assist in feature engineering?
What is the purpose of the SageMaker Feature Store?
The SageMaker Feature Store allows users to save, version, describe, and search for features. This facilitates the sharing and reuse of features across teams, improving collaboration and model development efficiency.
Provide an example of sagemaker implementation
To build a model that creates a musical playlist curated to the listener’s taste using Amazon SageMaker, we can summarize the process in five key points:
How does SageMaker contribute to the productivity and cost-efficiency of data science teams?
With SageMaker, data science teams can achieve up to a 10 times improvement in productivity and a 54% lower total cost of ownership (TCO) compared to other cloud platforms, thanks to its integrated development environment and automated processes