My Research Notes: What are Neural Networks

What are Neural Networks?

A neural network is a computational model inspired by the human brain's structure and function. It is a key component of machine learning and artificial intelligence. Neural networks are designed to recognize patterns and relationships in data by simulating how biological neurons interact.

Basic Structure of a Neural Network

A neural network consists of three main layers:

Input Layer:
- Receives the input data.
- Each neuron in this layer represents a feature or variable from the input data.
Hidden Layers:
- These are intermediate layers between the input and output layers.
- They perform computations, learning features and patterns from the input data.
- The number of hidden layers and neurons in each layer is determined by the problem being solved.
Output Layer:
- Produces the final output of the network.
- The output depends on the task, such as classification (e.g., predicting a category) or regression (e.g., predicting a continuous value).

The layers are composed of neurons (or nodes) that are interconnected, forming a dense network.

How Neural Networks Work

Input: Data is fed into the input layer.
Weighted Sum: Each neuron calculates a weighted sum of its inputs:
$z = \sum (w \cdot x) + b$
where:
- $w$ : weight
- $x : input value$
- $b$ : bias term
Activation Function: The weighted sum is passed through an activation function to introduce non-linearity:
$a = f(z)$
Common activation functions include:
- Sigmoid: Maps values to (0, 1), useful for probabilities.
- ReLU (Rectified Linear Unit): Sets negative values to 0, introducing sparsity.
- Tanh: Maps values to (-1, 1).
Propagation:
- Forward Propagation: Input data flows through the layers, and the output is computed.
- Backward Propagation: The network adjusts the weights and biases to minimize the error between predicted and actual outputs.
Loss Function:
- Measures the difference between the network's prediction and the actual target.
- Examples: Mean Squared Error (MSE) for regression, Cross-Entropy Loss for classification.
Optimization:
- Uses algorithms like Gradient Descent to update weights and biases, minimizing the loss function.

Types of Neural Networks

Feedforward Neural Networks (FNNs):
- Information flows in one direction from input to output.
- Used for tasks like regression and simple classification.
Convolutional Neural Networks (CNNs):
- Specialized for image data.
- Use convolutional layers to detect spatial patterns.
Recurrent Neural Networks (RNNs):
- Designed for sequential data like time series or text.
- Use loops to retain memory of previous inputs.
Long Short-Term Memory Networks (LSTMs):
- A type of RNN that overcomes the vanishing gradient problem.
- Effective for long-term dependencies in sequences.
Generative Adversarial Networks (GANs):
- Consist of two networks: a generator and a discriminator.
- Used for generating realistic data, such as images.
Transformer Networks:
- Powerful for natural language processing (NLP) tasks.
- Based on self-attention mechanisms, e.g., GPT, BERT.

Applications of Neural Networks

Image Recognition: Identifying objects in images (e.g., face recognition, medical imaging).
Natural Language Processing (NLP): Language translation, sentiment analysis, chatbots.
Speech Recognition: Converting speech to text.
Time Series Analysis: Predicting stock prices or weather trends.
Generative Models: Creating realistic images, videos, or music.
Autonomous Systems: Self-driving cars, robotics.

Strengths and Weaknesses

Strengths:

Can learn complex non-linear relationships.
Adaptable to a wide range of problems.
State-of-the-art performance in many AI domains (e.g., vision, NLP).

Weaknesses:

Require large datasets to perform well.
Computationally intensive and time-consuming to train.
Prone to overfitting without proper regularization.
Lack interpretability compared to simpler models.

Neural networks have revolutionized modern AI, making it possible to solve problems that were previously considered intractable.

My Research Notes

Thursday, 21 November 2024

What are Neural Networks

Basic Structure of a Neural Network

How Neural Networks Work

Types of Neural Networks

Applications of Neural Networks

Strengths and Weaknesses

No comments:

Post a Comment

Understading the Paper: Fine Grained Image Analysis with Deep Learning