Twitter Sentiment Analysis using Neural Networks

The repo includes code to process text, engineer features and perform sentiment analysis using Neural Networks. The project uses LSTM to train on the data and achieves a testing accuracy of 79%.

Setup

Install python

Install pyenv for managing Python versions

brew install pyenv

Install python with this flag

CFLAGS="-I$(xcrun --show-sdk-path)/usr/include" pyenv install 3.7.2

Get the code

Clone the repo to your machine

git clone https://github.com/kb22/Twitter-Sentiment-Analysis-using-Neural-Networks.git

Move into the folder

cd Twitter-Sentiment-Analysis-using-Neural-Networks

Install all dependencies

pip install -r requirements.txt

Download the dataset

The dataset has been taken from Kaggle

Download the file from kaggle.
Extract the zip and rename the csv to dataset.csv
Create a folder data inside Twitter-Sentiment-Analysis-using-Neural-Networks folder
Copy the file dataset.csv to inside the data folder

Working the code

Understanding the data

The Jupyter notebook Dataset analysis.ipynb includes analysis for the various columns in the dataset and a basic overview of the dataset.

Run Jupyter

jupyter notebook

Select the file Dataset analysis.ipynb from the list to see dataset analysis.

Twitter Sentiment Analysis

The whole project is broken into different Python files from splitting the dataset to actually doing sentiment analysis. The steps to carry out Twitter Sentiment Analysis are:

Run the file train-test-split.py to split the Twitter dataset into training and testing data.

python train-test-split.py

Run the file preprocessing.py to process the tweets.

Remove @user mentions
Remove non-alphabetic characters + spaces + apostrophe
Remove links
Remove single characters
Remove stopwords
Lemmatize words
Stem words

python preprocessing.py

After processing of the tweets, LSTM can be used to train on the data and test the accuracy on the test data.

python lstm.py

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.gitignore		.gitignore
Dataset analysis.ipynb		Dataset analysis.ipynb
README.md		README.md
_config.yml		_config.yml
lstm.py		lstm.py
ml_tfidf.py		ml_tfidf.py
preprocessing.py		preprocessing.py
requirements.txt		requirements.txt
train-test-split.py		train-test-split.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

Dataset analysis.ipynb

Dataset analysis.ipynb

README.md

README.md

_config.yml

_config.yml

lstm.py

lstm.py

ml_tfidf.py

ml_tfidf.py

preprocessing.py

preprocessing.py

requirements.txt

requirements.txt

train-test-split.py

train-test-split.py

Repository files navigation

Twitter Sentiment Analysis using Neural Networks

Setup

Install python

Get the code

Download the dataset

Working the code

Understanding the data

Twitter Sentiment Analysis

About

Releases

Packages

Languages

kb22/Twitter-Sentiment-Analysis

Folders and files

Latest commit

History

Repository files navigation

Twitter Sentiment Analysis using Neural Networks

Setup

Install python

Get the code

Download the dataset

Working the code

Understanding the data

Twitter Sentiment Analysis

About

Topics

Resources

Stars

Watchers

Forks

Languages