MemeCap: A Dataset for Captioning and Interpreting Memes |
|
|
➖ |
Incorporating Structured Representations into Pretrained Vision & Language Models using Scene Graphs |
➖ |
|
➖ |
From Wrong to Right: A Recursive Approach Towards Vision-Language Explanation |
|
|
➖ |
Variance Matters: Detecting Semantic Differences without Corpus/Word Alignment |
|
|
➖ |
Improving Transformer-based Program Repair Model through False Behavior Diagnosis |
|
|
➖ |
Coarse-to-Fine Contrastive Learning in Image-Text-Graph Space for Improved Vision-Language Compositionality |
➖ |
|
➖ |
Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data |
|
|
➖ |
AutoTrial: Prompting Language Models for Clinical Trial Design |
➖ |
|
➖ |
POE: Process of Elimination for Multiple Choice Reasoning |
|
|
➖ |
MProto: Multi-Prototype Network with Denoised Optimal Transport for Distantly Supervised Named Entity Recognition |
|
|
➖ |
Referring Image Segmentation via Joint Mask Contextual Embedding Learning and Progressive Alignment Network |
|
|
➖ |
DiSTRICT: Dialogue State Tracking with Retriever Driven In-Context Tuning |
➖ |
|
➖ |
Generative Table Pre-Training Empowers Models for Tabular Prediction |
|
|
➖ |
Spoiler Detection as Semantic Text Matching |
|
|
➖ |
Prompting Scientific Names for Zero-Shot Species Recognition |
➖ |
|
➖ |
R2H: Building Multimodal Navigation Helpers that Respond to Help Requests |
|
|
|
Image Manipulation via Multi-Hop Instructions - A New Dataset and Weakly-Supervised Neuro-Symbolic Approach |
➖ |
|
➖ |
q2d: Turning Questions into Dialogs to Teach Models how to Search |
➖ |
|
➖ |
The Benefits of Label-Description Training for Zero-Shot Text Classification |
|
|
➖ |
GEM: Gestalt Enhanced Markup Language Model for Web Understanding via Render Tree |
➖ |
|
➖ |
Pre-Training Intent-Aware Encoders for Zero- and Few-Shot Intent Classification |
|
|
➖ |
IC3: Image Captioning by Committee Consensus |
|
|
➖ |
Towards Conceptualization of "Fair Explanation": Disparate Impacts of Anti-Asian Hate Speech Explanations on Content Moderators |
|
|
➖ |
Contrastive Learning for Inference in Dialogue |
|
|
➖ |
Post-Hoc Utterance Refining Method by Entity Mining for Faithful Knowledge Grounded Conversations |
|
|
➖ |
Content- and Topology-Aware Representation Learning for Scientific Multi-Literature |
|
|
➖ |
Can Language Models Understand Physical Concepts? |
|
|
➖ |
GazeVQA: A Video Question Answering Dataset for Multiview Eye-Gaze Task-Oriented Collaborations |
|
|
➖ |
Continual Named Entity Recognition without Catastrophic Forgetting |
|
|
➖ |
A Generation-based Deductive Method for Math Word Problems |
|
|
➖ |
GNAT: A General Narrative Alignment Tool |
➖ |
|
➖ |
Analyzing Film Adaptation through Narrative Alignment |
➖ |
|
➖ |
DALE: Generative Data Augmentation for Low-Resource Legal NLP |
|
|
➖ |
CompoundPiece: Evaluating and Improving Decompounding Performance of Language Models |
|
|
➖ |
ChatEdit: Towards Multi-Turn Interactive Facial Image Editing via Dialogue |
|
|
➖ |
Towards LLM-Driven Dialogue State Tracking |
|
|
➖ |
Turn-Level Active Learning for Dialogue State Tracking |
➖ |
|
➖ |
StructGPT: A General Framework for Large Language Model to Reason over Structured Data |
|
|
➖ |
Towards Low-Resource Automatic Program Repair with Meta-Learning and Pretrained Language Models |
|
|
➖ |
Semi-Supervised Multimodal Coreference Resolution in Image Narrations |
➖ |
|
➖ |
Beyond Detection: A Defend-and-Summarize Strategy for Robust and Interpretable Rumor Analysis on Social Media |
|
|
➖ |
VLIS: Unimodal Language Models Guide Multimodal Language Generation |
|
|
➖ |
Event Ontology Completion with Hierarchical Structure Evolution Networks |
|
|
➖ |
LLM-Powered Data Augmentation for Enhanced Cross-Lingual Performance |
|
|
➖ |
Multi-Level Adaptive Contrastive Learning for Knowledge Internalization in Dialogue Generation |
|
|
➖ |
End-to-End Task-Oriented Dialogue: A Survey of Tasks, Methods, and Future Directions |
|
|
➖ |
KRLS: Improving End-to-End Response Generation in Task Oriented Dialog with Reinforced Keywords Learning |
|
|
➖ |
COFFEE: Counterfactual Fairness for Personalized Text Generation in Explainable Recommendation |
➖ |
|
➖ |
Multi-Source Probing for Open-Domain Conversational Understanding |
➖ |
|
➖ |
Enhancing Textbooks with Visuals from the Web for Improved Learning |
|
|
➖ |
Dual-Feedback Knowledge Retrieval for Task-Oriented Dialogue Systems |
➖ |
|
➖ |
Evaluating Object Hallucination in Large Vision-Language Models |
|
|
➖ |
DueT: Image-Text Contrastive Transfer Learning with Dual-Adapter Tuning |
|
|
➖ |
BioT5: Enriching Cross-Modal Integration in Biology with Chemical Knowledge and Natural Language Associations |
|
|
➖ |
Target-Oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation |
|
|
➖ |
Multi-Source Multi-Type Knowledge Exploration and Exploitation for Dialogue Generation |
|
|
➖ |
EDIS: Entity-Driven Image Search over Multimodal Web Content |
|
|
➖ |
Fine-Grained Conversational Decoding via Isotropic and Proximal Search |
|
|
➖ |
RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation |
|
|
➖ |
Reinforced Target-Driven Conversational Promotion |
➖ |
|
➖ |
Polar Ducks and where to Find them: Enhancing Entity Linking with Duck Typing and Polar Box Embeddings |
|
|
|
ScanDL: A Diffusion Model for Generating Synthetic Scanpaths on Texts |
|
|
|
Did You Mean ...? Confidence-based Trade-Offs in Semantic Parsing |
|
|
|
How do Languages Influence each Other? Studying Cross-Lingual Data Sharing During LM Fine-Tuning |
|
|
|
Text Embeddings Reveal (Almost) as much as Text |
|
|
|
Modeling Legal Reasoning: LM Annotation at the Edge of Human Agreement |
|
|
|
An Expression Tree Decoding Strategy for Mathematical Equation Generation |
|
|
|
BiasX: "Thinking Slow" in Toxic Content Moderation with Explanations of Implied Social Biases |
|
|
|
APoLLo: Unified Adapter and Prompt Learning for Vision Language Models |
|
|
|
Human Learning by Model Feedback: The Dynamics of Iterative Prompting with Midjourney |
|
|
|
PALS: Personalized Active Learning for Subjective Tasks in NLP |
|
|
|
Symbolic Planning and Code Generation for Grounded Dialogue |
|
|
|
Global Voices, Local Biases: Socio-Cultural Prejudices Across Languages |
|
|
|
XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language Models |
|
|
|
Transfer-Free Data-Efficient Multilingual Slot Labeling |
|
|
|
A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems |
|
|
|
DADA: Dialect Adaptation via Dynamic Aggregation of Linguistic Rules |
|
|
|
QA-NatVer: Question Answering for Natural Logic-based Fact Verification |
|
|
|
Can Language Models Laugh at YouTube Short-Form Videos? |
|
|
|
Can Pre-Trained Vision and Language Models Answer Visual Information-Seeking Questions? |
|
|
|
SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables |
|
|
|
Causal Reasoning through Two Cognition Layers for Improving Generalization in Visual Question Answering |
|
|
|
Log-FGAER: Logic-Guided Fine-Grained Address Entity Recognition from Multi-Turn Spoken Dialogue |
|
|
|
Document-Level Relationship Extraction by Bidirectional Constraints of Beta Rules |
|
|
|
LACMA: Language-Aligning Contrastive Learning with Meta-Actions for Embodied Instruction Following |
|
|
|
Language Model is Suitable for Correction of Handwritten Mathematical Expressions Recognition |
|
|
|
Analyzing Cognitive Plausibility of Subword Tokenization |
|
|
|
An Iteratively Parallel Generation Method with the Pre-Filling Strategy for Document-Level Event Extraction |
|
|
|
Empirical Study of Zero-Shot NER with ChatGPT |
|
|
|
SAMRank: Unsupervised Keyphrase Extraction using Self-Attention Map in BERT and GPT-2 |
|
|
|
Revisiting Sparse Retrieval for Few-Shot Entity Linking |
|
|
|
Weakly-Supervised Learning of Visual Relations in Multimodal Pretraining |
|
|
|
Evaluating Bias and Fairness in Gender-Neutral Pretrained Vision-and-Language Models |
|
|
|
ACTOR: Active Learning with Annotator-Specific Classification Heads to Embrace Human Label Variation |
|
|
|
Appraising the Potential uses and Harms of LLMs for Medical Systematic Reviews |
|
|
|
Impressions: Visual Semiotics and Aesthetic Impact Understanding |
|
|
|
CLEVR-Implicit: A Diagnostic Dataset for Implicit Reasoning in Referring Expression Comprehension |
|
|
|
Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning |
|
|
|
LIMIT: Language Identification, Misidentification, and Translation using Hierarchical Models in 350+ Languages |
|
|
|
ViPE: Visualise Pretty-Much Everything |
|
|
|
Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models |
|
|
|
Unifying Cross-Lingual Transfer Across Scenarios of Resource Scarcity |
|
|
|
NeuSTIP: A Neuro-Symbolic Model for Link and Time Prediction in Temporal Knowledge Graphs |
|
|
|
Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models |
|
|
|
Set Learning for Generative Information Extraction |
|
|
|
Confidence-based Ensembling of Perspective-Aware Models |
|
|
|
Multitask Multimodal Prompted Training for Interactive Embodied Task Completion |
|
|
|
Improving Unsupervised Relation Extraction by Augmenting Diverse Sentence Pairs |
|
|
|
GROOViST: A Metric for Grounding Objects in Visual Storytelling |
|
|
|
Multimodal Embodied Plan Prediction Augmented with Synthetic Embodied Dialogue |
|
|
|
S2abEL: A Dataset for Entity Linking from Scientific Tables |
|
|
|
Revisiting the Optimality of Word Lengths |
|
|
|
ZGUL: Zero-Shot Generalization to Unseen Languages using Multi-Source Ensembling of Language Adapters |
|
|
|
Code-Switching Metrics using Intonation Units |
|
|
|
Joint Entity and Relation Extraction with Span Pruning and Hypergraph Neural Networks |
|
|
|
Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information Extraction |
|
|
|
The BLA Benchmark: Investigating Basic Language Abilities of Pre-Trained Multimodal Models |
|
|
|
Physician Detection of Clinical Harm in Machine Translation: Quality Estimation Aids in Reliance and Backtranslation Identifies Critical Errors |
|
|
|
Explaining with Contrastive Phrasal Highlighting: A Case Study in Assisting Humans to Detect Translation Differences |
|
|
|
Understanding the Role of Input Token Characters in Language Models: How does Information Loss Affect Performance? |
|
|
|
What else do I Need to Know? The Effect of Background Information on Users' Reliance on QA Systems |
|
|
|
HiddenTables and PyQTax: A Cooperative Game and Dataset for TableQA to Ensure Scale and Data Privacy Across a Myriad |
|
|
|
of Taxonomies |
|
|
|
Language and Mental Health: Measures of Emotion Dynamics from Text as Linguistic Biosocial Markers |
|
|
|
Evaluating and Modeling Attribution for Cross-Lingual Question Answering |
|
|
|
Analyzing Modular Approaches for Visual Question Decomposition |
|
|
|
Emergence of Abstract State Representations in Embodied Sequence Modeling |
|
|
|
Task-Agnostic Low-Rank Adapters for Unseen English Dialects |
|
|
|
Abstractive Open Information Extraction |
|
|
|
Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans? |
|
|
|
UniChart: A Universal Vision-Language Pretrained Model for Chart Comprehension and Reasoning |
|
|
|
Better Quality Pre-Training Data and T5 Models for African Languages |
|
|
|
Semi-Automatic Data Enhancement for Document-Level Relation Extraction with Distant Supervision from Large Language |
|
|
|
Models |
|
|
|
Unified Low-Resource Sequence Labeling by Sample-Aware Dynamic Sparse Finetuning |
|
|
|
ACQUIRED: A Dataset for Answering Counterfactual Questions In Real-Life Videos |
|
|
|
When the Majority is Wrong: Modeling Annotator Disagreement for Subjective Tasks |
|
|
|
Let's Think Frame by Frame with VIP: A Video Infilling and Prediction Dataset for Evaluating Video Chain-of-Thought |
|
|
|
Struct-XLM: A Structure Discovery Multilingual Language Model for Enhancing Cross-Lingual Transfer through Reinforcement Learning |
|
|
|
GlobalBench: A Benchmark for Global Progress in Natural Language Processing |
|
|
|
Towards Building more Robust NER Datasets: An Empirical Study on NER Dataset Bias from a Dataset Difficulty View |
|
|
|
ALDi: Quantifying the Arabic Level of Dialectness of Text |
|
|
|
Learning to Rank Context for Named Entity Recognition using a Synthetic Dataset |
|
|
|
From Dissonance to Insights: Dissecting Disagreements in Rationale Construction for Case Outcome Classification |
|
|
|
Language Model Quality Correlates with Psychometric Predictive Power in Multiple Languages |
|
|
|
Clustering Pseudo Language Family in Multilingual Translation Models with Fisher Information Matrix |
|
|
|
HyperNetwork-based Decoupling to Improve Model Generalization for Few-Shot Relation Extraction |
|
|
|
When Reviewers Lock Horns: Finding Disagreements in Scientific Peer Reviews |
|
|
|
Adaptive End-to-End Metric Learning for Zero-Shot Cross-Domain Slot Filling |
|
|
|
Detecting Spoilers in Movie Reviews with External Movie Knowledge and User Networks |
|
|
|
NL2TL: Transforming Natural Languages to Temporal Logics using Large Language Models |
|
|
|
HyperRank: Hyperbolic Ranking Model for Unsupervised Keyphrase Extraction |
|
|
|
A Picture is Worth a Thousand Words: Language Models Plan from Pixels |
|
|
|
Reader: Model-based Language-Instructed Reinforcement Learning |
|
|
|
GenEx: A Commonsense-Aware Unified Generative Framework for Explainable Cyberbullying Detection |
|
|
|
Fine-Grained Medical Vision-Language Representation Learning for Radiology Report Generation |
|
|
|
A Cross-Linguistic Pressure for Uniform Information Density in Word Order |
|
|
|
T2-NER: A Two-Stage Span-based Framework for Unified Named Entity Recognition with Templates |
|
|
|
Testing the Predictions of Surprisal Theory in 11 Languages |
|
|
|
U-CORE: A Unified Deep Cluster-Wise Contrastive Framework for Open Relation Extraction |
|
|
|
Language Varieties of Italy: Technology Challenges and Opportunities |
|
|
|
mGPT: Few-Shot Learners Go Multilingual |
|
|
|
Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation |
|
|
|