Problems and Solutions (text classification approach)

In this repository, I will share with you some of the problems, I faced and my proposed solutions. It is worth mentioning that the codes are run on "Large Movie Review Dataset 1.0" data set, which is available freely on https://www.kaggle.com/datasets/macespinoza/large-movie-review-dataset-10. It will be my pleasure if you share your experinces with these problems and your ideas about my solutions.

First problem: bias impact on LSTM and GRU performance

Based on "An Empirical Exploration of Recurrent Network Architectures.pdf" paper, I came up with the fantastic impact of adding bias in improving convergence of LSTM and GRU. I tested this issue on a text classification code and uploaded its script as "Add Bias to Network".

Second problem: add vectorization layer on LSTM and GRU performance

I was curious about the performance of an rnn architecture for text classification purposes when using a vectorization layer instead of implementing vectorization out of the network (it was used in "Add Bias to Network" script). My investigation's results show almost no difference between these two approaches. I tested this issue on a text classification code and uploaded its script as "Add Vectorization Layer ".

Then, a model was learned (its weights were saved and then loaded) to evaluate last script performance on a desirable text. It is worth mentioing that because my system configuration was not good enough, I opted shorter sequence length to reduce the computational load. This script uploaded as 'Save Model'.

Third problem: using one hot encoding instead of embeding layer

In this investigation, I tried to use one hot encoding instead of an embeding layer to see its effect on the performance of our network. Even though I succeeded in encoding the integer output of the vectorization layer, I could not feed it to N.N architecture.

Indeed, I faced this error:

-Exception encountered when calling layer "gru" (type GRU). Input 'b' of 'MatMul' Op has type float32 that does not match type int32 of argument 'a'.

The script of this part is uploaded as "One Hot Encode".

Fourth problem: a text classification project from scratch

In this section, I made my real effort to do a text classification project from scratch. Firstly, the imported data (it is accessible via https://www.kaggle.com/code/bindur/amazon-baby-sentiment/notebook) is preprocessed. In this stage, respectively, the problem is turned into a binary classification, the rows containing "Nan" values are removed, the data is split to train (75%) and test (25%) and, it is vectorized. Secondly, for classification purpose a three- layer architecture consisting of an embeded , an LSTM and, a fully connected layer is designed. Finally, the predicted data is compared with ground truth using classification_report. The script of this part is uploaded as "Project".

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
.gitignore		.gitignore
Add Bias to Network.py		Add Bias to Network.py
Add Vectorization Layer to Network.py		Add Vectorization Layer to Network.py
One Hot Encode.py		One Hot Encode.py
Project.py		Project.py
README.md		README.md
Save Model.py		Save Model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Problems and Solutions (text classification approach)

First problem: bias impact on LSTM and GRU performance

Second problem: add vectorization layer on LSTM and GRU performance

Third problem: using one hot encoding instead of embeding layer

Fourth problem: a text classification project from scratch

About

Releases

Packages

Languages

mohammadmehdikeramati/RNNs-problems.solutions

Folders and files

Latest commit

History

Repository files navigation

Problems and Solutions (text classification approach)

First problem: bias impact on LSTM and GRU performance

Second problem: add vectorization layer on LSTM and GRU performance

Third problem: using one hot encoding instead of embeding layer

Fourth problem: a text classification project from scratch

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages