|
| 1 | +# Classifying Evolutionary Forces in Languages Change |
| 2 | + |
| 3 | +A fundamental problem in research into language and cultural change is the difficulty of |
| 4 | +distinguishing processes of stochastic drift (also known as neutral evolution) from |
| 5 | +processes that are subject to certain selection pressures. In this article, we describe a |
| 6 | +new technique based on Deep Neural Networks, in which we reformulate the detection of |
| 7 | +evolutionary forces in cultural change as a binary classification task. Using Residual |
| 8 | +Networks for time series trained on artificially generated samples of cultural change, we |
| 9 | +demonstrate that this technique is able to efficiently, accurately and consistently learn |
| 10 | +which aspects of the time series are distinctive for drift and selection. We compare the |
| 11 | +model with a recently proposed statistical test, the Frequency Increment Test, and show |
| 12 | +that the neural time series classification system provides a possible solution to some of |
| 13 | +the key problems of this test. |
| 14 | + |
| 15 | +## Data |
| 16 | + |
| 17 | +Code to reconstruct the past-tense data set can be obtained from |
| 18 | +https://github.com/mnewberry/ldrift. To run the past-tense analysis in |
| 19 | +`notebooks/past-tense.ipynb`, save the frequency list under `data/coha-past-tense.txt`. |
| 20 | + |
| 21 | +## Requirements |
| 22 | +All code is implemented in Python 3.7. A detailed list of the requirements to run the code |
| 23 | +can be found in the `requirements.txt` file. |
| 24 | + |
| 25 | +## Training |
| 26 | + |
| 27 | +To train your own models, run `src/train.py` and follow the instructions therein. |
| 28 | + |
| 29 | +--- |
| 30 | +<a rel="license" href="http://creativecommons.org/licenses/by/4.0/"><img alt="Creative Commons License" style="border-width:0" src="https://i.creativecommons.org/l/by/4.0/88x31.png" /></a><br />This work is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</a>. |
| 31 | + |
0 commit comments