User Experience on mobile might not be great yet, but I'm working on it.

Your first time on this page? Allow me to give some explanations.

Awesome Natural Language Generation

A curated list of resources dedicated to Natural Language Generation (NLG)

Here you can see meta information about this topic like the time we last updated this page, the original creator of the awesome list and a link to the original GitHub repository.

Last Update: Dec. 3, 2020, 12:14 a.m.

Thank you tokenmill & contributors
View Topic on GitHub:
tokenmill/awesome-nlg

Search for resources by name or description.
Simply type in what you are looking for and the results will be filtered on the fly.

Further filter the resources on this page by type (repository/other resource), number of stars on GitHub and time of last commit in months.

Datasets

Dataset for NLG which contains preceding context along with each generation instance

22
11
4y 77d
n/a
9
1
2y 100d
Apache-2.0

The enriched version of the WebNLG described at INLG 2018

41
11
1y 89d
n/a

This dataset gathers 728,321 biographies from wikipedia. It aims at evaluating text generation algorithms. For each article, we provide the first paragraph and the infobox (both tokenized).

121
19
4y 72d
n/a

The Schema-Guided Dialogue Dataset

263
64
35d
CC-BY-SA-4.0

E2E

This shared task focuses on recent end-to-end (E2E), data-driven NLG methods, which jointly learn sentence planning and surface realisation from non-aligned data.

Computer-generated weather forecasts from weather.gov (US public forecast), along with corresponding weather data.

Company descriptions collected from Wikipedia. The dataset contains semantic representations, short, and long descriptions for 51K companies in English.

YelpNLG provides resources for natural language generation of restaurant reviews.

Dialog

๐ŸŽฏ๐Ÿ—ฏ Generate datasets for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!

645
127
1y 62d
MIT

NNDial is an open source toolkit for building end-to-end trainable task-oriented dialogue models. It is released by Tsung-Hsien (Shawn) Wen from Cambridge Dialogue Systems Group under Apache License 2.0.

326
105
3y 5m
n/a

This is the Plato Research Dialogue System, a flexible platform for developing conversational AI agents.

866
168
84d
Apache-2.0

RNNLG is an open source benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains. It is released by Tsung-Hsien (Shawn) Wen from Cambridge Dialogue Systems Group under Apache License 2.0.

484
133
3y 115d
n/a

Statistical NLG for spoken dialogue systems

177
48
5m
n/a

Evaluation

A tool for holistic analysis of language generations systems

316
49
5d
BSD-3-Clause

Evaluation code for various unsupervised automated metrics for Natural Language Generation.

779
142
5m
n/a

An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)

291
40
83d
MIT

Grammar

OpenCCG library for parsing and realization with CCG

185
41
1y 4m
LGPL-2.1
50
17
3y 2d
MIT

Combinatory Categorial Grammar (CCG): CCG and probabilistic CCG, with all combinators and their powers.

16
1
9m
n/a

A Web platform for parsing and annotation

5
0
1y 4m
n/a

A programming language for multilingual grammar applications.

Libraries

A .NET library that converts cron expressions into human readable descriptions.

569
145
57d
MIT

Convert a number to an approximated text expression: from '0.23' to 'less than a quarter'.

185
9
10d
n/a

Narrative Generation

Using Natural Language Generation (NLG) to create a random short story.

49
11
8y 6m
n/a

Tracery: a story-grammar generation library for javascript

1.75K
194
5y 7d
Apache-2.0

Neural Natural Language Generation

A robust Python tool for text-based AI training and generation using GPT-2.

830
60
5m
MIT

Graph to sequence implemented in Pytorch combining Graph convolutional networks and opennmt-py

125
24
1y 4m
MIT

[DEPRECATED] A Neural Network based generative model for captioning images using Tensorflow

138
58
1y 3d
BSD-3-Clause

Code for PaperRobot: Incremental Draft Generation of Scientific Ideas

355
112
8m
MIT

Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.

593
103
64d
Apache-2.0

Neural question generation using transformers

229
65
19d
MIT

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/

2.06K
335
4m
Apache-2.0

Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.

4.16K
661
4m
n/a

This Word Does Not Exist

609
54
6m
MIT

๐Ÿค—Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.

37.4K
9.15K
3d
Apache-2.0

Implementation of the paper -> https://arxiv.org/abs/1709.00155. For converting information present in the form of structured data into natural language text

165
55
1y 8m
MIT

Papers and Articles

Products

Accelerated Text helps you to automatically generate natural language descriptions of your data, varying in wording and structure.

199
17
3d
n/a

An open-source library for node.js or client side (browser) execution, based on the Pug template engine, to generate texts in English, French, German and Italian.

An open-source tool for telling interactive, nonlinear stories.

Realizers

Surface realiser (part of a Natural Language Generation system) using Tree Adjoining Grammar

22
2
5y 7m
GPL-2.0

A JavaScript bilingual text realizer for web development

11
4
84d
Apache-2.0

Java API for Natural Language Generation. Originally developed by Ehud Reiter at the University of Aberdeenโ€™s Department of Computing Science and co-founder of Arria NLG. This git repo is the official SimpleNLG version.

687
170
70d
n/a

German version of SimpleNLG 4

7
0
8m
n/a

SimpleNLG-EnFr 1.1 is a bilingual English/French adaption of SimpleNLG v4.2

23
10
2y 7m
n/a

Templating Languages

A Ruby library for generating text with recursive template grammars.

48
2
10d
MIT

Natural language generation language

47
24
2y 8m
MIT

Java template engine (with ports for C##, Objective-C, JavaScript, Scala) for generating source code, web pages, emails, or any other formatted text output.

Videos