All technological notes.
Artificial intelligence (AI):
Machine learning (ML):
artificial intelligence (AI) that enables machines to use data and algorithms to simulate human behaviorDeep Learning(DL):
Machine learning(ML) which use neural networks to simulate human behaviorGenerative AI
deep-learning models that can generate highquality text, images, videosalgorithm is trained using the historical data to make predictions on new data.
3 Categories:
Supervised Learning
labelled dataset, tested using test dataClassification: discrete categoryRegression: a continuous numerical valueUnsupervised Learning
unlabeled datasetClustering: groups similar data points (rows) into clustersAssociation: finds rules and relationships between variables (items) that frequently occur togetherReinforcement Learning
Deep Learning - focuses on building artificial neural networks that can learn from data.Artificial Neural networks
focuses on generating high-quality text, images, code
Inference: Act of using model to generate text is called Inference
Prompt: Input provided to ModelTokenizer: breaks down raw text into smaller, manageable units called tokens
token: the fundamental, small units of data—words, characters, or subwords—that AI models use to process and generate language.token id: a unique integer assigned to a specific token (a word, part of a word, or character) from a model’s pre-defined vocabulary.context window/working memory:
input prompt + output response) an Large Language Model (LLM) can process at one time, measured in tokens.Max tokens: the upper limit on tokens (text fragments) a model can handle in one request, including both input prompt and output responsestop sequence user-defined string of characters, words, or tokens that signals a large language model (LLM) to cease text generation immediately upon producing itModel/Foundation Model: GPT, Claudedetokenizer
tokenization, converting a sequence of numerical tokens (IDs) or split subwords generated by a language model back into readable, natural language textCompletion: Output of ModelFMs as the base for building task-specific models and Adapt themLarge Language Models (LLM):
Small Language Model (SLM):
Large Language Models (LLM’s):
Image Generation or Diffusion Model
Multi-Modal Model
Embedding Model