Deep learning based sequence to sequence model for abstractive telugu text summarization

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Related collections

Most cited references 31

Record: found
Abstract: found
Article: not found

Long Short-Term Memory

Jürgen Schmidhuber, Jürgen Schmidhuber (2003)

Learning to store information over extended time intervals by recurrent backpropagation takes a very long time, mostly because of insufficient, decaying error backflow. We briefly review Hochreiter's (1991) analysis of this problem, then address it by introducing a novel, efficient, gradient-based method called long short-term memory (LSTM). Truncating the gradient where this does not do harm, LSTM can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units. Multiplicative gate units learn to open and close access to the constant error flow. LSTM is local in space and time; its computational complexity per time step and weight is O(1). Our experiments with artificial data involve local, distributed, real-valued, and noisy pattern representations. In comparisons with real-time recurrent learning, back propagation through time, recurrent cascade correlation, Elman nets, and neural sequence chunking, LSTM leads to many more successful runs, and learns much faster. LSTM also solves complex, artificial long-time-lag tasks that have never been solved by previous recurrent network algorithms.

0 comments Cited 7768 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Learning long-term dependencies with gradient descent is difficult.

Y Bengio, P. Simard, P Frasconi (1994)

Recurrent neural networks can be used to map input sequences to output sequences, such as for recognition, production or prediction problems. However, practical difficulties have been reported in training recurrent neural networks to perform tasks in which the temporal contingencies present in the input/output sequences span long intervals. We show why gradient based learning algorithms face an increasingly difficult problem as the duration of the dependencies to be captured increases. These results expose a trade-off between efficient learning by gradient descent and latching on information for long periods. Based on an understanding of this problem, alternatives to standard gradient descent are considered.

0 comments Cited 677 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Conference Proceedings: not found

Get To The Point: Summarization with Pointer-Generator Networks

Abigail See, Peter Liu, Christopher D Manning (2017)

0 comments Cited 216 times – based on 0 reviews

Bookmark

All references

Author and article information

Contributors

G. L. Anand Babu: (View ORCID Profile)

Journal

Title: Multimedia Tools and Applications

Abbreviated Title: Multimed Tools Appl

Publisher: Springer Science and Business Media LLC

ISSN (Print): 1380-7501

ISSN (Electronic): 1573-7721

Publication date Created: May 2023

Publication date (Electronic): November 07 2022

Publication date (Print): May 2023

Volume: 82

Issue: 11

Pages: 17075-17096

Article

DOI: 10.1007/s11042-022-14099-x

SO-VID: a65ae3a9-a338-4b2b-9734-339f2cb86d8e

License:

https://www.springernature.com/gp/researchers/text-and-data-mining

History

Data availability:

Comments

Comment on this article

scite_

Smart Citations

Citing PublicationsSupportingMentioningContrasting

View Citations

See how this article has been cited at scite.ai

scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.