Document worth reading: “Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods”

Integration of imaginative and prescient and language duties has seen a giant progress in the newest events attributable to surge of curiosity from multi-disciplinary communities akin to deep learning, laptop computer imaginative and prescient, and pure language processing. In this survey, we give consideration to 10 completely totally different imaginative and prescient and language integration duties in phrases of their draw back formulation, methods, current datasets, evaluation measures, and comparability of outcomes achieved with the corresponding state-of-the-art methods. This goes previous earlier surveys which are each task-specific or focus solely on one sort of seen content material materials i.e., image or video. We then conclude the survey by discussing some attainable future directions for integration of imaginative and prescient and language evaluation. Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods