NICT QE/APE Dataset

Introduction

NICT QE/APE Dataset is a multilingual parallel corpus consisting of transcribed utterances in Japanese and their MT outputs in several languages, manually associated with their gradings and post-edits. The dataset is developed for training and evaluating systems for the following four tasks.

Features

News

Download

Todo

References

Precautions

License

Creative Commons License

Use and/or redistribution of the NICT QE/APE Dataset is permitted under the conditions of Creative Commons Attribution-NonCommercial-ShareAlike License 4.0.

Acknowledgments

The dataset has been developed at Advanced Translation Technology Laboratory, Advanced Speech Translation Research and Development Promotion Center, National Institute of Information and Communications Technology under the program "Promotion of Global Communications Plan: Research, Development, and Social Demonstration of Multilingual Speech Translation Technology" of the Ministry of Internal Affairs and Communications (MIC), Japan.