Comparison of popular models and architectures

Related to:


Table - Single models with options

Detailed Breakdown of Popular Models and Architectures

Autoencoders

Category Details
Basic Components Encoder, Decoder, Latent Space
Open-Source/ Forbidden Open-Source
Criteria of Measuring Productivity Reconstruction Loss
Model’s Parameters Weights (Changeable), Latent Dimensions (Fixed)
Criteria of Measuring Parameter’s Productivity Reconstruction Accuracy
Model’s Hyperparameters Learning Rate, Latent Dimension Size (Changeable)
Criteria of Measuring Hyperparameter’s Productivity Lower Reconstruction Error
Basic Components of Hyperparameter’s Productivity Effective Latent Space Size, Training Convergence Rate

CNN (Convolutional Neural Networks)

Category Details
Basic Components Convolution Layers, Pooling, Fully Connected Layers
Open-Source/ Forbidden Open-Source
Criteria of Measuring Productivity Accuracy, Precision, Recall, F1-Score
Model’s Parameters Filter Weights (Changeable), Input Channels (Fixed)
Criteria of Measuring Parameter’s Productivity Detection Accuracy
Model’s Hyperparameters Kernel Size, Stride, Number of Filters (Changeable)
Criteria of Measuring Hyperparameter’s Productivity Higher Feature Extraction Quality
Basic Components of Hyperparameter’s Productivity Filter Efficiency, Computational Cost

RNN (Recurrent Neural Networks)

Category Details
Basic Components Recurrent Layers, Activation Functions
Open-Source/ Forbidden Open-Source
Criteria of Measuring Productivity Perplexity, Accuracy, BLEU Score (NLP)
Model’s Parameters Hidden State (Changeable), Sequence Length (Fixed)
Criteria of Measuring Parameter’s Productivity Temporal Pattern Capture Efficiency
Model’s Hyperparameters Learning Rate, Hidden State Size (Changeable)
Criteria of Measuring Hyperparameter’s Productivity Sequence Learning Performance
Basic Components of Hyperparameter’s Productivity Effective Sequence Memory Size

LSTM (Long Short-Term Memory Networks)

Category Details
Basic Components LSTM Cells (Input, Forget, Output Gates)
Open-Source/ Forbidden Open-Source
Criteria of Measuring Productivity Perplexity, Accuracy (Time-Series, NLP)
Model’s Parameters Cell Weights (Changeable), Memory Cell (Fixed)
Criteria of Measuring Parameter’s Productivity Long-Term Dependency Capture Efficiency
Model’s Hyperparameters Learning Rate, Number of Layers (Changeable)
Criteria of Measuring Hyperparameter’s Productivity Retention of Long-Term Dependencies
Basic Components of Hyperparameter’s Productivity Sequence Retention and Gradient Stability

GNN (Graph Neural Networks)

Category Details
Basic Components Node Embeddings, Edge Features, Graph Convolutions
Open-Source/ Forbidden Open-Source
Criteria of Measuring Productivity Node Classification Accuracy, Link Prediction
Model’s Parameters Edge Weights (Changeable), Node Attributes (Fixed)
Criteria of Measuring Parameter’s Productivity Graph Feature Capture Efficiency
Model’s Hyperparameters Number of Layers, Embedding Size (Changeable)
Criteria of Measuring Hyperparameter’s Productivity Graph-Level Feature Generalization
Basic Components of Hyperparameter’s Productivity Graph Topology Learning

BERT (Bidirectional Encoder Representations from Transformers)

Category Details
Basic Components Encoder, Multi-Head Attention, Feedforward Layers
Open-Source/ Forbidden Open-Source
Criteria of Measuring Productivity F1-Score, Exact Match (QA), Perplexity
Model’s Parameters Token Embeddings (Changeable), Vocabulary (Fixed)
Criteria of Measuring Parameter’s Productivity Contextual Understanding Quality
Model’s Hyperparameters Learning Rate, Batch Size, Sequence Length (Changeable)
Criteria of Measuring Hyperparameter’s Productivity Contextual Embedding Accuracy
Basic Components of Hyperparameter’s Productivity Attention Mechanism, Positional Encoding

BART (Bidirectional and Auto-Regressive Transformers)

Category Details
Basic Components Encoder-Decoder, Multi-Head Attention, Feedforward
Open-Source/ Forbidden Open-Source
Criteria of Measuring Productivity Rouge Score, BLEU Score
Model’s Parameters Attention Weights (Changeable), Vocabulary (Fixed)
Criteria of Measuring Parameter’s Productivity Summarization and Translation Accuracy
Model’s Hyperparameters Learning Rate, Number of Heads (Changeable)
Criteria of Measuring Hyperparameter’s Productivity Text Generation Quality
Basic Components of Hyperparameter’s Productivity Encoder-Decoder Consistency

T5 (Text-to-Text Transfer Transformer)

Category Details
Basic Components Encoder-Decoder, Attention Mechanisms, Feedforward
Open-Source/ Forbidden Open-Source
Criteria of Measuring Productivity Rouge Score, BLEU Score
Model’s Parameters Token Embeddings (Changeable), Vocabulary (Fixed)
Criteria of Measuring Parameter’s Productivity Text-to-Text Conversion Accuracy
Model’s Hyperparameters Sequence Length, Beam Width (Changeable)
Criteria of Measuring Hyperparameter’s Productivity Text Generation Coherence
Basic Components of Hyperparameter’s Productivity Attention Span, Latent Representation Quality

LLAMA

Category Details
Basic Components Transformer Layers, Feedforward Layers, Attention
Open-Source/ Forbidden Restricted for Modifications
Criteria of Measuring Productivity F1-Score, Rouge Score
Model’s Parameters Attention Weights (Changeable), Vocabulary (Fixed)
Criteria of Measuring Parameter’s Productivity Latent Representation Consistency
Model’s Hyperparameters Number of Layers, Head Size (Changeable)
Criteria of Measuring Hyperparameter’s Productivity Layer-to-Layer Weight Propagation
Basic Components of Hyperparameter’s Productivity Transformer Block Efficiency

GPT (Generative Pre-trained Transformer)

Category Details
Basic Components Transformer Decoder, Feedforward Layers, Attention
Open-Source/ Forbidden Open-Source
Criteria of Measuring Productivity Perplexity, BLEU Score
Model’s Parameters Attention Weights (Changeable), Vocabulary (Fixed)
Criteria of Measuring Parameter’s Productivity Generative Text Coherence
Model’s Hyperparameters Learning Rate, Model Depth, Token Limit (Changeable)
Criteria of Measuring Hyperparameter’s Productivity Generative Text Quality
Basic Components of Hyperparameter’s Productivity Token Context Understanding

ViT (Vision Transformer)

Category Details
Basic Components Patch Embedding, Transformer Layers, Attention
Open-Source/ Forbidden Open-Source
Criteria of Measuring Productivity Accuracy, Precision, Recall, F1-Score
Model’s Parameters Patch Embeddings (Changeable), Image Size (Fixed)
Criteria of Measuring Parameter’s Productivity Visual Feature Generalization
Model’s Hyperparameters Patch Size, Attention Heads (Changeable)
Criteria of Measuring Hyperparameter’s Productivity Patch Extraction Accuracy, Attention Span
Basic Components of Hyperparameter’s Productivity Image Feature Learning Efficiency