Academy & Industry Research Collaboration Center (AIRCC)

Volume 12, Number 19, November 2022

Comparison of Sequence Models for Text Narration from Tabular Data

  Authors

Mayank Lohani, Rohan Dasari, Praveen Thenraj Gunasekaran, Selvakuberan Karuppasamy and Subhashini Lakshminarayanan, Data and AI, Advance Technology Centers in India & Accenture, Gurugram, India

  Abstract

This paper demonstrates our work on the survey of pre-trained transformer models for text narration from tabular data. Understanding the meaning of data from tables or any other data source requires human effort and time to interpret the content. In this era of internet where data is exponentially growing and massive improvement in technology, we propose an NLP (Natural Language Processing) based approach where we can generate the meaningful text from the table without the human intervention. In this paper we propose transformer-based models with the goal to generate natural human interpretable language text generated from the input tables. We propose transformer based pre-trained model that is trained with structured and context rich tables and their respective summaries. We present comprehensive comparison between different transformer-based models and conclude with mentioning key points and future research roadmap.

  Keywords

Survey, NLP (Natural Language Processing), Transformers, Table to Text.