summary/summaries/Week-10 at master · gopala-kr/summary

Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md

2016-12

A recurrent neural network without Chaos [arXiv]
Language Modeling with Gated Convolutional Networks [arXiv]
Learning from Simulated and Unsupervised Images through Adversarial Training [arXiv]
How Grammatical is Character-level Neural Machine Translation? Assessing MT Quality with Contrastive Translation Pairs [arXiv]
Improving Neural Language Models with a Continuous Cache [arXiv]
DeepMind Lab [arXiv] [code]
Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning [arXiv]
Overcoming catastrophic forgetting in neural networks [arXiv]

2016-11 (ICLR Edition)

Image-to-Image Translation with Conditional Adversarial Networks [arXiv]
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer [OpenReview]
Learning to reinforcement learn [arXiv]
A Way out of the Odyssey: Analyzing and Combining Recent Insights for LSTMs [arXiv]
Adversarial Training Methods for Semi-Supervised Text Classification [arXiv]
Importance Sampling with Unequal Support [arXiv]
Quasi-Recurrent Neural Networks [arXiv]
Capacity and Learnability in Recurrent Neural Networks [OpenReview]
Unrolled Generative Adversarial Networks [OpenReview]
Deep Information Propagation [OpenReview]
Structured Attention Networks [OpenReview]
Incremental Sequence Learning [arXiv]
Delving into Transferable Adversarial Examples and Black-box Attacks [arXiv] [code]
b-GAN: Unified Framework of Generative Adversarial Networks [OpenReview]
A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks [OpenReview]
Categorical Reparameterization with Gumbel-Softmax [arXiv]
Lip Reading Sentences in the Wild [arXiv]

Reinforcement Learning:

Learning to reinforcement learn [arXiv]
A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models [arXiv]
The Predictron: End-To-End Learning and Planning [OpenReview]
Third-Person Imitation Learning [OpenReview]
Generalizing Skills with Semi-Supervised Reinforcement Learning [OpenReview]
Sample Efficient Actor-Critic with Experience Replay [OpenReview]
Reinforcement Learning with Unsupervised Auxiliary Tasks [arXiv]
Neural Architecture Search with Reinforcement Learning [OpenReview]
Towards Information-Seeking Agents [OpenReview]
Multi-Agent Cooperation and the Emergence of (Natural) Language [OpenReview]
Improving Policy Gradient by Exploring Under-appreciated Rewards [OpenReview]
Stochastic Neural Networks for Hierarchical Reinforcement Learning [OpenReview]
Tuning Recurrent Neural Networks with Reinforcement Learning [OpenReview]
RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning [arXiv]
Learning Invariant Feature Spaces to Transfer Skills with Reinforcement Learning [OpenReview]
Learning to Perform Physics Experiments via Deep Reinforcement Learning [OpenReview]
Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU [OpenReview]
Learning to Compose Words into Sentences with Reinforcement Learning[OpenReview]
Deep Reinforcement Learning for Accelerating the Convergence Rate [OpenReview]
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning [arXiv]
Learning to Compose Words into Sentences with Reinforcement Learning [OpenReview]
Learning to Navigate in Complex Environments [arXiv]
Unsupervised Perceptual Rewards for Imitation Learning [OpenReview]
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic [OpenReview]

Machine Translation & Dialog

Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation [arXiv]
Neural Machine Translation with Reconstruction [arXiv]
Iterative Refinement for Machine Translation [OpenReview]
A Convolutional Encoder Model for Neural Machine Translation [arXiv]
Improving Neural Language Models with a Continuous Cache [OpenReview]
Vocabulary Selection Strategies for Neural Machine Translation [OpenReview]
Towards an automatic Turing test: Learning to evaluate dialogue responses [OpenReview]
Dialogue Learning With Human-in-the-Loop [OpenReview]
Batch Policy Gradient Methods for Improving Neural Conversation Models [OpenReview]
Learning through Dialogue Interactions [OpenReview]
Dual Learning for Machine Translation [arXiv]
Unsupervised Pretraining for Sequence to Sequence Learning [arXiv]

2016-10

Hybrid computing using a neural network with dynamic external memory [nature] [code]
Understanding deep learning requires rethinking generalization [arXiv]
Universal adversarial perturbations [arXiv] [code]
Neural Machine Translation in Linear Time [arXiv] [code]
Professor Forcing: A New Algorithm for Training Recurrent Networks [arXiv]
Learning to Protect Communications with Adversarial Neural Cryptography [arXiv]
Can Active Memory Replace Attention? [arXiv]
Using Fast Weights to Attend to the Recent Past [arXiv]
Fully Character-Level Neural Machine Translation without Explicit Segmentation [arXiv]
Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models [arXiv]
Video Pixel Networks [arXiv]
Connecting Generative Adversarial Networks and Actor-Critic Methods [arXiv]
Learning to Translate in Real-time with Neural Machine Translation [arXiv]
Xception: Deep Learning with Depthwise Separable Convolutions [arXiv]
Collective Robot Reinforcement Learning with Distributed Asynchronous Guided Policy Search [arXiv]
Pointer Sentinel Mixture Models [arXiv]

2016-09

Towards Deep Symbolic Reinforcement Learning [arXiv]
HyperNetworks [arXiv]
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation [arXiv]
Safe and Efficient Off-Policy Reinforcement Learning [arXiv]
Playing FPS Games with Deep Reinforcement Learning [arXiv]
SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient [arXiv]
Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks [arXiv]
Energy-based Generative Adversarial Network [arXiv]
Stealing Machine Learning Models via Prediction APIs [arXiv]
Semi-Supervised Classification with Graph Convolutional Networks [arXiv]
WaveNet: A Generative Model For Raw Audio [arXiv]
Hierarchical Multiscale Recurrent Neural Networks [arXiv]
End-to-End Reinforcement Learning of Dialogue Agents for Information Access [arXiv]
Deep Neural Networks for YouTube Recommendations [paper]

2016-08

Semantics derived automatically from language corpora contain human-like biases [arXiv]
Why does deep and cheap learning work so well? [arXiv]
Machine Comprehension Using Match-LSTM and Answer Pointer [arXiv]
Stacked Approximated Regression Machine: A Simple Deep Learning Approach [arXiv]
Decoupled Neural Interfaces using Synthetic Gradients [arXiv]
WikiReading: A Novel Large-scale Language Understanding Task over Wikipedia [arXiv]
Temporal Attention Model for Neural Machine Translation [arXiv]
Residual Networks of Residual Networks: Multilevel Residual Networks [arXiv]
Learning Online Alignments with Continuous Rewards Policy Gradient [arXiv]

2016-07

An Actor-Critic Algorithm for Sequence Prediction [arXiv]
Cognitive Science in the era of Artificial Intelligence: A roadmap for reverse-engineering the infant language-learner [arXiv]
Recurrent Neural Machine Translation [arXiv]
MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition [arXiv]
Layer Normalization [arXiv]
Neural Machine Translation with Recurrent Attention Modeling [arXiv]
Neural Semantic Encoders [arXiv]
Attention-over-Attention Neural Networks for Reading Comprehension [arXiv]
sk_p: a neural program corrector for MOOCs [arXiv]
Recurrent Highway Networks [arXiv]
Bag of Tricks for Efficient Text Classification [arXiv]
Context-Dependent Word Representation for Neural Machine Translation [arXiv]
Dynamic Neural Turing Machine with Soft and Hard Addressing Schemes [arXiv]

2016-06

Sequence-to-Sequence Learning as Beam-Search Optimization [arXiv]
Sequence-Level Knowledge Distillation [arXiv]
Policy Networks with Two-Stage Training for Dialogue Systems [arXiv]
Towards an integration of deep learning and neuroscience [arXiv]
On Multiplicative Integration with Recurrent Neural Networks [arxiv]
Wide & Deep Learning for Recommender Systems [arXiv]
Online and Offline Handwritten Chinese Character Recognition [arXiv]
Tutorial on Variational Autoencoders [arXiv]
Concrete Problems in AI Safety [arXiv]
Deep Reinforcement Learning Discovers Internal Models [arXiv]
SQuAD: 100,000+ Questions for Machine Comprehension of Text [arXiv]
Conditional Image Generation with PixelCNN Decoders [arXiv]
Model-Free Episodic Control [arXiv]
Progressive Neural Networks [arXiv]
Improved Techniques for Training GANs [arXiv] [code]
Memory-Efficient Backpropagation Through Time [arXiv]
InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets [arXiv]
Zero-Resource Translation with Multi-Lingual Neural Machine Translation [arXiv]
Key-Value Memory Networks for Directly Reading Documents [arXiv]
Deep Recurrent Models with Fast-Forward Connections for Neural Machine Translatin [arXiv]
Learning to learn by gradient descent by gradient descent [arXiv]
Learning Language Games through Interaction [arXiv]
Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations [arXiv]
Smart Reply: Automated Response Suggestion for Email [arXiv]
Virtual Adversarial Training for Semi-Supervised Text Classification [arXiv]
Deep Reinforcement Learning for Dialogue Generation [arXiv]
Very Deep Convolutional Networks for Natural Language Processing [arXiv]
Neural Net Models for Open-Domain Discourse Coherence [arXiv]
Neural Architectures for Fine-grained Entity Type Classification [arXiv]
Matching Networks for One Shot Learning [arXiv]
Cooperative Inverse Reinforcement Learning [arXiv] [article]
Gated-Attention Readers for Text Comprehension [arXiv]
End-to-end LSTM-based dialog control optimized with supervised and reinforcement learning [arXiv]
Iterative Alternating Neural Attention for Machine Reading [arXiv]
Memory-enhanced Decoder for Neural Machine Translation [arXiv]
Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation [arXiv]
Learning to Optimize [arXiv] [article]
Natural Language Comprehension with the EpiReader [arXiv]
Conversational Contextual Cues: The Case of Personalization and History for Response Ranking [arXiv]
Adversarially Learned Inference [arXiv]
OpenAI Gym [arXiv] [code]
Neural Network Translation Models for Grammatical Error Correction [arXiv]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Week-10

Week-10

README.md

2016-12

2016-11 (ICLR Edition)

2016-10

2016-09

2016-08

2016-07

2016-06

Files

Week-10

Directory actions

More options

Directory actions

More options

Latest commit

History

Week-10

Folders and files

parent directory

README.md

2016-12

2016-11 (ICLR Edition)

2016-10

2016-09

2016-08

2016-07

2016-06