Publications

Back to home

2024

  1. Stephen Mayhew, Terra Blevins, Shuheng Liu, Marek Suppa, Hila Gonen, Joseph Marvin Imperial, Börje F. Karlsson, Peiqin Lin, Nikola Ljubešić, Lester James Validad Miranda, Barbara Plank, Arij Riabi, Yuval Pinter. Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark In NAACL 2024.
  2. Leon Weber, Robert Litschko, Ekaterina Artemova and Barbara Plank. Donkii: Characterizing and Detecting Errors in Instruction-Tuning Datasets. In LAW EACL 2024 workshop.
  3. Axel Sorensen, Siyao Peng, Barbara Plank and Rob van der Goot. EEVEE: An Easy Annotation Tool for Natural Language Processing. In LAW EACL 2024 workshop.
  4. Cornelia Gruber, Katharina Hechinger, Matthias Aßenmacher, Goeran Kauermann and Barbara Plank. More Labels or Cases? Assessing Label Variation in Natural Language Inference. In Unimplicit EACL 2024 workshop.
  5. Siyao Peng, Zihang Sun, Sebastian Loftus and Barbara Plank. Different Tastes of Entities: Investigating Human Label Variation in Named Entity Annotations. In Unimplicit EACL 2024 workshop.
  6. Elena Senger, Mike Zhang, Rob van der Goot and Barbara Plank. Deep Learning-based Computational Job Market Analysis: A Survey on Skill Extraction and Classification from Job Postings. In NLP4HR 2024 workshop at EACL 2024.
  7. Joris Baan, Raquel Fernandez, Barbara Plank and Wilker Aziz. Interpreting Predictive Probabilities: Model Confidence or Human Label Variation? In EACL 2024.
  8. Mike Zhang, Rob van der Goot, Min-Yen Kan and Barbara Plank . NNOSE: Nearest Neighbor Occupational Skill Extraction. In EACL 2024.
  9. Mike Zhang, Rob van der Goot and Barbara Plank. Entity Linking in the Job Market Domain. In Findings of the ACL: EACL 2024.
  10. Katya Artemova, Verena Blaschke and Barbara Plank. Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German Varieties. In EACL 2024.

2023

  • Shengqiang Zhang, Philipp Wicke, Lütfi Kerem Şenel, Luis Figueredo, Abdeldjallil Naceri, Sami Haddadin, Barbara Plank, Hinrich Schütze. LoHoRavens: A Long-Horizon Language-Conditioned Benchmark for Robotic Tabletop Manipulation. In RobotLearning@NeurIPS2023.
  • Joris Baan, Nico Daheim, Evgenia Ilia, Dennis Ulmer, Haau-Sing Li, Raquel Fernández, Barbara Plank, Rico Sennrich, Chrysoula Zerva, Wilker Aziz. Uncertainty in Natural Language Generation: From Theory to Applications. arXiv:2307.15703
  • Mario Giulianelli, Joris Baan, Wilker Aziz, Raquel Fernández and Barbara Plank. What Comes Next? Evaluating Uncertainty in Neural Text Generators Against Human Production Variability. In EMNLP 2023.
  • Robert Litschko, Max Müller-Eberstein, Rob van der Goot, Leon Weber and Barbara Plank. Establishing Trustworthiness: Rethinking Tasks and Model Evaluation. In EMNLP 2023.
  • Xinpeng Wang and Barbara Plank. ACTOR: Active learning with annotator-specific classification heads to embrace human label variation. In EMNLP 2023.
  • Shanshan Xu, Santosh T.Y.S.S, Oana Ichim, Isabella Risini, Barbara Plank and Matthias Grabmair. From Dissonance to Insights: Dissecting Disagreements in Rationale Dataset Construction for Case Outcome Classification. In EMNLP 2023.
  • Max Müller-Eberstein, Rob van der Goot, Barbara Plank and Ivan Titov. Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training. In EMNLP 2023 Findings. [arXiv] [code] [video]
  • Mike Zhang, Rob van der Goot and Barbara Plank. ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain. In ACL 2023. [arXiv] [code]
  • Xinpeng Wang, Leonie Weissweiler, Hinrich Schütze and Barbara Plank. How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives. In ACL 2023. [arXiv] [code]
  • Leon Weber and Barbara Plank. ActiveAED: A Human in the Loop Improves Annotation Error Detection. In Findings of ACL 2023. [arXiv] [code]
  • Robert Litschko, Ekaterina Artemova and Barbara Plank. Boosting Zero-shot Cross-lingual Retrieval by Training on Artificially Code-Switched Data. In Findings of ACL 2023. [arXiv] [code]
  • Elisa Bassignana, Filip Ginter, Sampo Pyysalo, Rob van der Goot and Barbara Plank. Silver Syntax Pre-training for Cross-Domain Relation Extraction. In ACL Findings 2023. [arXiv] [code]
  • Elisa Leonardelli, Gavin Abercrombie, D. Almanea, Valerio Basile, Tommaso Fornaciari, Barbara Plank, Verena Rieser, Alexandra Uma, Massimo Poesio. SemEval-2023 Task 11: Learning With Disagreements (LeWiDi). In SemEval, ACL 2023 workshop. [arXiv]
  • Verena Blaschke, Hinrich Schütze and Barbara Plank. A Survey of Corpora for Germanic Low-Resource Languages and Dialects. In NoDaLiDa 2023. [pdf] [repository]
  • Katya Artemova and Barbara Plank. Low-resource Bilingual Dialect Lexicon Induction with Large Language Models. In NoDaLiDa 2023. [pdf] [code]
  • Elisa Bassignana, Filip Ginter, Sampo Pyysalo, Rob van der Goot and Barbara Plank. Multi-CrossRE A Multi-lingual Multi-Domain Dataset for Relation Extraction. In NoDaLiDa 2023. [arXiv] [pdf] [code & data]
  • Noëmi Aepli, Çağrı Çöltekin, Rob Van Der Goot, Tommi Jauhiainen, Mourhaf Kazzaz, Nikola Ljubešić, Kai North, Barbara Plank, Yves Scherrer, and Marcos Zampieri. 2023. Findings of the VarDial Evaluation Campaign 2023. In Tenth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2023), pages 251–261, Dubrovnik, Croatia. Association for Computational Linguistics. [pdf]
  • Verena Blaschke, Hinrich Schütze and Barbara Plank. Does manipulating tokenization aid cross-lingual transfer? A study on POS tagging for non-standardized languages. In VarDial, EACL 2023 workshop. [pdf] [code]

2022

  • Elisa Bassignana, Max Müller-Eberstein, Mike Zhang, and Barbara Plank. Evidence > Intuition: Transferability Estimation for Encoder Selection. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP 2022. [pdf] [code]
  • Max Müller-Eberstein, Rob van der Goot Barbara Plank. Spectral Probing. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP 2022. [pdf] [code]
  • Joris Baan, Wilker Aziz, Barbara Plank and Raquel Fernández. Stop Measuring Calibration When Humans Disagree. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP 2022. [pdf] [code]
  • Barbara Plank. The "Problem” of Human Label Variation: On Ground Truth in Data, Modeling and Evaluation. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP 2022. [pdf] [repository]
  • Dennis Ulmer, Elisa Bassignana, Max Müller-Eberstein, Daniel Varab, Mike Zhang, Rob van der Goot, Christian Hardmeier, and Barbara Plank. Experimental Standards for Deep Learning in Natural Language Processing Research. In Findings of the Association for Computational Linguistics: EMNLP 2022. [pdf] [repository]
  • Elisa Bassignana and Barbara Plank. CrossRE: A Cross-Domain Dataset for Relation Extraction. In Findings of the Association for Computational Linguistics: EMNLP 2022. [pdf] [code]
  • Tanja Samardzić, Ximena Gutierrez-Vasques, Rob van der Goot, Max Müller-Eberstein, Olga Pelloni and Barbara Plank. On Language Spaces, Scales and Cross-Lingual Transfer of UD Parsers. In CoNLL 2022. [pdf]
  • Kostiantyn Kucher, Nicole Sultanum, Angel Daza, Vasiliki Simaki, Maria Skeppstedt, Barbara Plank, Jean-Daniel Fekete, Narges Mahayar. An Interdisciplinary Perspective on Evaluation and Experimental Design for Visual Text Analytics: Position Paper. In BELIV 2022 workshop at VIS, IEEE. [pdf]
  • Mike Zhang, Kristian Nørgaard Jensen, Rob van der Goot and Barbara Plank. Skill Extraction from Job Postings using Weak Supervision. In RecSys in HR'22: The 2nd Workshop on Recommender Systems for Human Resources, in conjunction with the 16th ACM Conference on Recommender Systems, September 18--23, 2022, Seattle, USA.
  • Dennis Ulmer, Elisa Bassignana, Max Müller-Eberstein, Daniel Varab, Mike Zhang, Christian Hardmeier and Barbara Plank. Experimental Standards for Deep Learning Research: A Natural Language Processing Perspective. In ICLR Workshop on ML Evaluation Standards. [pdf] [video & poster] [repo] Received an Outstanding Paper Award
  • Max Müller-Eberstein, Rob van der Goot and Barbara Plank. Sort by Structure: Language Model Ranking as Dependency Probing. In NAACL 2022. [pdf] [code]
  • Mike Zhang, Kristian Nørgaard Jensen, Sif Dam Sonniks and Barbara Plank. SkillSpan: Hard and Soft Skill Extraction from English Job Postings. In NAACL 2022. [pdf] [code]
  • Elisa Bassignana and Barbara Plank. What Do You Mean by Relation Extraction? A Survey on Datasets and Study on Scientific Relation Classification. In ACL 2022 Student Research Workshop. [pdf] [code]
  • Max Müller-Eberstein, Rob van der Goot and Barbara Plank. Probing for Labeled Dependency Trees. In ACL 2022. [arXiv] [pdf] [code]
  • Kristian Nørgaard Jensen and Barbara Plank. Fine-tuning vs From Scratch: Do Vision & Language Models Have Similar Capabilities on Out-of-Distribution Visual Question Answering? In LREC 2022. [pdf]
  • Rob van der Goot, Max Müller-Eberstein and Barbara Plank. Frustratingly Easy Performance Improvements for Low-Resource Setups: A Tale on BERT and Segment Embeddings. In LREC 2022. [pdf]
  • Mike Zhang, Kristian Nørgaard Jensen and Barbara Plank. Kompetencer: Fine-grained Skill Classification in Danish Job Postings via Distant Supervision and Transfer Learning. In LREC 2022. [pdf]
  • Barbara Plank. Sliced at SemEval-2022 Task 11: Bigger, Better? Massively Multilingual LMs for Multilingual Complex NER on an Academic GPU Budget. In SemEval 2022, NAACL Workshop. [pdf]
  • Andreas Nugaard Holm, Barbara Plank, Dustin Wright, Isabelle Augenstein. Longitudinal Citation Prediction using Temporal Graph Neural Networks. AAAI 2022 Workshop on Scientific Document Understanding (SDU 2022), February 2022. [arXiv]

2021

  • Alexandra Uma, Tommaso Fornaciari, Dirk Hovy, Silviu Paun, Barbara Plank and Massimo Poesio. Learning from Disagreement: A Survey. In the Journal for Artificial Intelligence Research (JAIR), Volume 72, December 2021. [pdf]
  • Max Müller-Eberstein, Rob van der Goot and Barbara Plank. How Universal is Genre in Universal Dependencies? In SyntaxFest 2021. [arXiv]
  • Max Müller-Eberstein, Rob van der Goot and Barbara Plank. Genre as Weak Supervision for Cross-lingual Dependency Parsing. In EMNLP 2021. [arXiv] [bib] [code]
  • Mike Zhang and Barbara Plank. Cartography Active Learning. In EMNLP 2021 Findings. [arXiv] [code]
  • Cathrine Damgaard, Paulina Toborek, Trine Eriksen and Barbara Plank. “I’ll be there for you”: The One with Understanding Indirect Answers. The 2nd Workshop on Computational Approaches to Discourse (CODI) at EMNLP 2021. [pdf]
  • Benjamin Ahrentløv Olsen and Barbara Plank. Finding the needle in a haystack: Extraction of Informative COVID-19 Danish Tweets. The 7th Workshop on Noisy User-generated Text (W-NUT) at EMNLP 2021. [pdf] [data]
  • Rob van der Goot, Alan Ramponi, Arkaitz Zubiaga, Barbara Plank, Benjamin Muller, Iñaki San Vicente Roncal, Nikola Ljubešić, Rahmad Mahendra, Talha Çolakoglu, Timothy Baldwin, Tommaso Caselli, Wladimir Sidorenko. MultiLexNorm: A Shared Task on Multilingual Lexical Normalization. In Proceedings of the 7th Workshop on Noisy User-generated Text (W-NUT 2021), EMNLP workshop. [shared task website]
  • Erkut Erdem, Menekse Kuyu, Semih Yagcioglu, Anette Frank, Letitia Parcalabescu, Barbara Plank, Andrii Babii, Oleksii Turuta, Aykut Erdem, Iacer Calixto, Elena Lloret, Elena-Simona Apostol, Ciprian-Octavian Truică, Branislava Šandrih, Albert Gatt, Sanda Martinčić-Ipšić, Gábor Berend, Grăzina Korvel. Neural Natural Language Generation: A Survey on Multilinguality, Multimodality, Controllability and Learning. Accepted to appear in the Journal for Artificial Intelligence Research (JAIR).
  • Rob van der Goot, Ibrahim Sharaf, Aizhan Imankulova, Ahmet Üstün, Marija Stepanović, Alan Ramponi, Siti Oryza Khairunnisa, Mamoru Komachi and Barbara Plank. From Masked-Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding. In NAACL 2021. [pdf] [data (xSID) & code]
  • Michael A. Hedderich, Benjamin Roth, Katherina Kann, Barbara Plank, Alex Ratner, Ditrich Klakow (editors). Proceedings of the First Workshop on Weakly Supervised Learning (WeaSuL) ICLR 2021 Workshop on Weakly Supervised Learning
  • Tommaso Fornaciari, Alexandra Uma, Silviu Paun, Barbara Plank, Dirk Hovy and Massimo Poesio. Beyond Black & White: Leveraging Annotator Disagreement via Soft-Label Multi-Task Learning. In NAACL 2021. [pdf] [code]
  • Rob van der Goot, Ahmet Üstün, Alan Ramponi, Ibrahim Sharaf and Barbara Plank. Massive Choice, Ample Tasks (MaChAmp): A Toolkit for Multi-task Learning in NLP. In EACL 2021. Received EACL 2021 Oustanding Paper Award (demo track) [pdf] [code]
  • Valerio Basile, Michael Fell, Tommaso Fornaciari, Dirk Hovy, Silviu Paun, Barbara Plank, Massimo Poesio and Alexandra Uma. We Need to Consider Disagreement in Evaluation. In ACL-IJCNLP 2021 Workshop on Benchmarking: Past, Present and Future. [pdf]
  • Barbara Plank. Cross-Lingual Cross-Domain Nested Named Entity Evaluation on English Web Texts. In ACL-IJCNLP 2021 Findings. [pdf] [data]
  • Alexandra Uma, Tommaso Fornaciari, Anca Dumitrache, Tristan Miller, Jon Chamberlain, Barbara Plank, Edwin Simpson and Massimo Poesio. SemEval-2021 Task 12: Learning with Disagreements. In SemEval, co-located with ACL-IJCNLP 2021. [pdf]
  • Barbara Plank. From back to the roots into the gated woods: Deep learning for NLP. In Teaching NLP, NAACL 2021 workshop. [paper]
  • Maria Barrett, Hieu Trong Lam, Martin Wu, Ophélie Lacroix, Barbara Plank and Anders Søgaard. Resources and Evaluations for Danish Entity Resolution. In Fourth Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC) at EMNLP 2021. [pdf]
  • Rob van der Goot, Ahmet Üstün and and Barbara Plank. On the Effectiveness of Dataset Embeddings in Mono-lingual, Multi-lingual and Zero-shot Conditions. In the Second Workshop for Domain Adaptation in Natural Language Processing (AdaptNLP), EACL 2021. [paper] [code]
  • Kristian Nørgaard Jensen, Mike Zhang and Barbara Plank. De-identification of Privacy-related Entities in Job Postings. In NoDaLiDa 2021. [paper] [code] [slides] [video]

2020

  • Alan Ramponi and Barbara Plank. Neural Unsupervised Domain Adaptation in NLP---A Survey. [arXiv] [repository]. In COLING 2020.
  • Alexandra Uma, Tommaso Fornaciari, Dirk Hovy, Silviu Paun, Barbara Plank. and Massimo Poesio. A Case for Soft Loss Functions. In HCOMP 2020, AAAI press.
  • Barbara Plank, Kristian Nørgaard Jensen and Rob van der Goot. DaN+ - Danish Nested Named Entities and Lexical Normalization. [pdf] [code & data] In COLING 2020.
  • Alan Ramponi, Rob van der Goot, Rosario Lombardo and Barbara Plank. BeeSL: Biomedical Event Extraction as Sequence Labeling . In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP) 2020 [paper]
  • Anders Giovanni Møller, Rob van der Goot, Barbara Plank. NLP North at WNUT-2020 Task 2: Pre-training versus Ensembling for Detection of Informative COVID-19 English Tweets. In WNUT 2020 workshop, EMNLP. [paper]
  • Alan Ramponi, Barbara Plank, Rosario Lombardo. Cross-Domain Evaluation of Edge Detection for Biomedical Event Extraction. In LREC 2020. [paper] [bib] [data & code]
  • Claudio Greco, Barbara Plank, Raquel Fernández, and Raffaella Bernardi. Measuring Catastrophic Forgetting in Visual Question Answering. To appear in Springer book series.
  • Rob van der Goot, Ahmet Ustün, Alan Ramponi, Barbara Plank. Massive Choice, Ample Tasks (MaChAmp): A Toolkit for Multi-task Learning in NLP. arXiv:2005.14672, 2020. [paper] [code]
  • Andreas Kirkedal, Marija Stepanovic and Barbara Plank. FT Speech: Danish Parliament Speech Corpus. In Proceedings of Interspeech 2020. [arXiv pre-print] [data & code]
  • Anders Friis Kaas, Viktor Torp Thomsen and Barbara Plank. Team DiSaster at SemEval-2020 Task 11: Combining BERT and hand-crafted features for Identifying Propaganda Techniques in News. In SemEval-2020. [paper] [code]
  • Kristian Nørgaard Jensen, Nicolaj Filrup Rasmussen, Thai Wang, Marco Placenti and Barbara Plank. Buhscitu at SemEval-2020 Task 7: Assessing Humour in Edited News Headlines using Hand-Crafted Features and Online Knowledge Bases. In SemEval-2020. [paper] [code]
  • Claudio Greco, Barbara Plank, Raquel Fernandez and Raffaella Bernardi. Measuring Catastrophic Forgetting in Visual Question Answering. Increasing Naturalness and Flexibility in Spoken Dialogue Interaction: 10th International Workshop on Spoken Dialogue Systems. Springer book chapter.

2019

  • Claudio Greco, Barbara Plank, Raquel Fernández, Raffaella Bernardi. Psycholinguistics meets Continual Learning: Measuring Catastrophic Forgetting in Visual Question Answering. In ACL 2019. [ArXiV] [ACL Anthology] [project website]
  • Sigrid Klerke and Barbara Plank. At a Glance: The Impact of Gaze Aggregation Views on Syntactic Tagging. In LANTERN, EMNLP 2019 workshop, Hong Kong.
  • Nils Rethmeier and Barbara Plank. MoRTy: Unsupervised Learning of Task-specialized Word Embeddings by Autoencoding. In RepL4NLP, ACL 2019 workshop.
  • Barbara Plank and Reinard van Dalen. CiteTracked: A Longitudinal Dataset of Peer Reviews and Citations. In BIRNDL, SIGIR workshop 2019. [paper] [data (contact me, online soon)]
  • Barbara Plank and Sigrid Klerke. Lexical Resources for Low-Resource PoS Tagging in Neural Times. In NoDaLiDa 2019.
  • Barbara Plank. Cross-Lingual Transfer and Very Little Labeled Data for Named Entity Recognition in Danish. In NoDaLiDa 2019.
  • Andreas Kirkedal, Barbara Plank, Leon Derczynski and Natalie Schluter. The Lacunae of Danish Natural Language Processing. In NoDaLiDa 2019.
  • Claudio Greco, Barbara Plank, Raquel Fernández and Raffaella Bernardi. Measuring Catastrophic Forgetting in Visual Question Answering. In Proceedings of Tenth International Workshop on Spoken Dialogue Systems Technology (IWSDS) 2019.
  • Ravi Shekhar, Aashish Venkatesh, Tim Baumgärtner, Elia Bruni, Barbara Plank, Raffaella Bernardi and Raquel Fernández. A closer look at jointly learning to see, ask, and GuessWhat. In NAACL 2019. [pdf]

2018

  • Barbara Plank and Željko Agić. Distant supervision from disparate sources for low-resource part-of-speech tagging. In Proceedings of EMNLP 2018. [ACL anthology] [arXiv] [code]
  • Martin Kroon, Masha Medvedeva and Barbara Plank. When Simple n-gram Models Outperform Syntactic Approaches: Discriminating between Dutch and Flemish. In Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects. COLING workshop, 2018.
  • Sebastian Ruder and Barbara Plank. Strong Baselines for Neural Semi-supervised Learning under Domain Shift. In ACL 2018, Melbourne, Australia. [arXiv] [code]
  • Rob van der Goot, Nikola Ljubešić, Ian Matroos, Malvina Nissim and Barbara Plank. Bleaching Text: Abstract Features for Cross-lingual Gender Prediction. In ACL 2018, Melbourne, Australia. [arXiv] [code]
  • Katharina Kann, Johannes Bjerva, Isabelle Augenstein, Barbara Plank and Anders Søgaard. Character-level Supervision for Low-resource POS Tagging. In Proceedings of the 1st Workshop on Deep Learning Approaches for Low Resource Natural Language Processing (DeepLo 2018).
  • Barbara Plank. Predicting Authorship and Author Traits from Keystroke Dynamics. In Proceedings of the Second Workshop on Computational Modeling of People's Opinions, Personality, and Emotions in Social Media (PEOPLES). NAACL workshop, 2018.
  • Sigrid Klerke, Héctor Martínez Alonso and Barbara Plank. Grotoco@SLAM: Second language acquisition modeling with simple features, learners and task-wise models. In Proceedings of the NAACL-HLT Workshop on Innovative Use of NLP for Building Educational Applications (BEA), NAACL workshop, 2018.

2017

  • Barbara Plank. All-In-1: Short Text Classification with One Model for All Languages. In IJCNLP 2017 Proceedings of the Shared Task on Customer Feedback Analysis. Won the shared task (ranked 1st out of 12 teams) [arXiv] [code]
  • Sebastian Ruder and Barbara Plank. Learning to select data for transfer learning with Bayesian Optimization. In EMNLP 2017, Copenhagen, Denmark. [arXiv] [code] [poster]
  • Malvina Nissim, Lasha Abzianidze, Kilian Evang, Rob van der Goot, Hessel Haagsma, Barbara Plank and Martijn Wieling. Sharing is Caring: The Future of Shared Tasks. In Computational Linguistics journal (Last Words). [pdf]
  • Rob van der Goot, Barbara Plank and Malvina Nissim. To Normalize, or Not to Normalize: The Impact of Normalization on Part-of-Speech Tagging. In WNUT 2017, EMNLP 2017, Copenhagen, Denmark. [arXiv] [code]
  • Reinder Gerard van Dalen, Léon Redmar Melein and Barbara Plank. Profiling Dutch Authors on Twitter: Discovering Political Preference and Income Level. In Computational Linguistics in the Netherlands (CLIN) journal, 2017. [pdf]
  • Héctor Martínez Alonso and Barbara Plank. When is multitask learning effective? Semantic sequence prediction under varying data conditions. In EACL 2017 (long), Valencia, Spain. [arXiv] [ACL anthology] [bib] [slides] [data]
  • Héctor Martínez Alonso, Željko Agić, Barbara Plank and Anders Søgaard. Parsing Universal Dependencies without training. In EACL 2017 (long), Valencia, Spain. [arXiv] [ACL anthology] [bib] [code]
  • Željko Agić, Barbara Plank and Anders Søgaard. Cross-Lingual Tagger Evaluation Without Test Data. In EACL 2017 (short), Valencia, Spain. [ACL anthology] [bib]
  • Maria Medvedeva, Martin Kroon and Barbara Plank. When Sparse Traditional Models Outperform Dense Neural Networks: the Curious Case of Discriminating between Similar Languages. Proceedings of the 4th Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial4). EACL 2017, Valencia, Spain. [pdf]
  • Artur Kulmizev, Bo Blankers, Johannes Bjerva, Malvina Nissim, Gertjan van Noord, Barbara Plank and Martijn Wieling. The Power of Character N-grams in Native Language Identification. To Appear in BEA 2017, EMNLP workshop, Copenhagen, Denmark.
  • Johannes Bjerva, Gintare Grigonyte, Robert Östling and Barbara Plank. Neural Networks and Spelling Features for Native Language Identification. To Appear in BEA 2017, EMNLP workshop, Copenhagen, Denmark.
  • Raffaella Bernardi, Ruket Cakici, Desmond Elliott, Aykut Erdem, Erkut Erdem, Nazli Ikizler-Cinbis, Frank Keller, Adrian Muscat and Barbara Plank. Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures (Extended abstract). In IJCAI, 2017 (journal track), Melbourne, Australia.

2016

  • Barbara Plank and Malvina Nissim. When silver glitters more than gold: Bootstrapping an Italian part-of-speech tagger for Twitter. In CLiC, PoSTWITA 2016 shared task. [arXiv]
  • Barbara Plank. Keystroke dynamics as signal for shallow syntactic parsing. The 26th International Conference on Computational Linguistics (COLING). Osaka, Japan. [arXiv] Awarded finalist for COLING 2016 best paper award
  • Barbara Plank. The side benefit of behavior: using keystroke dynamics to inform Natural Language Processing. WiML 2016 (NIPS workshop) [abstract]
  • Johannes Bjerva, Barbara Plank and Johan Bos. Semantic Tagging with Deep Residual Networks. The 26th International Conference on Computational Linguistics (COLING). Osaka, Japan. [arXiv]
  • Chloe Braud, Barbara Plank and Anders Søgaard. Multi-view and multi-task training of RST discourse parsers. The 26th International Conference on Computational Linguistics (COLING). [pdf]
  • Barbara Plank. What to do about non-standard (or non-canonical) language in NLP. In KONVENS 2016 [invited]. [pdf] [arXiv] [KONVENS proceedings]
  • Željko Agić, Anders Johannsen, Barbara Plank, Héctor Martínez Alonso, Natalie Schluter and Anders Søgaard. Multilingual Projection for Parsing Truly Low-Resource Languages. In [TACL], 2016.
  • Barbara Plank, Anders Søgaard and Yoav Goldberg. Multilingual Part-of-Speech Tagging with Bidirectional Long Short-Term Memory Models and Auxiliary Loss. In ACL (short), 2016. [arXiv, errata]
  • Héctor Martínez Alonso, Anders Johannsen and Barbara Plank. Supersense tagging with inter-annotator disagreement. In LAW-X, ACL workshop, 2016. [pdf]
  • Olga Uryupina, Barbara Plank, Gianni Barlacchi, Francisco J Valverde-Albacete, Manos Tsagkias and Alessandro Moschitti. LiMoSiNe pipeline: Multilingual UIMA-based NLP platform. In ACL demo papers, 2016.
  • Ben Verhoeven, Walter Daelemans and Barbara Plank. TwiSty: a Multilingual Twitter Stylometry Corpus for Gender and Personality Profiling. In LREC 2016. [pdf] [LREC pdf] [technical report]
  • Raffaella Bernardi, Ruket Cakici, Desmond Elliott, Aykut Erdem, Erkut Erdem, Nazli Ikizler-Cinbis, Frank Keller, Adrian Muscat and Barbara Plank. Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures. In JAIR. [arXiv preprint] [JAIR]
  • Aliaksei Severyn, Alessandro Moschitti, Olga Uryupina, Barbara Plank, and Katja Filippova Multi-lingual Opinion Mining on YouTube. In Information Processing Management (IPM) journal, Volume 52, January 2016 [journal, paper]

2015

  • Anders Johannsen, Héctor Martínez Alonso and Barbara Plank. Universal dependencies for Danish. In Treebank and Linguistic Theories (TLT14), 2015. [pdf]
  • Barbara Plank and Dirk Hovy. Personality Traits on Twitter---or---How to Get 1,500 Personality Tests in a Week. In WASSA 2015, EMNLP 2015 workshop. [pdf] [pdf from ACL anthology] [bib] [poster]
  • Barbara Plank, Héctor Martínez Alonso, Željko Agić, Danijela Merkler, Anders Søgaard. Do dependency parsing measures correlate with human judgements? In CoNLL 2015. [pdf] [poster] [data]
  • Thien Huu Nguyen, Barbara Plank and Ralph Grishman. Semantic Representations for Domain Adaptation: A Case Study on the Tree Kernel-based Method for Relation Extraction. In ACL 2015 (long). [pdf] [pdf from ACL anthology] [code]
  • Anders Søgaard, Željko Agić, Héctor Martínez Alonso, Barbara Plank, Bernd Bohnet and Anders Johannsen. Inverted indexing for cross-lingual NLP. In ACL 2015 (long). [pdf] [pdf from ACL anthology]
  • Peter Halkier Nicolajsen and Barbara Plank. Using knowledge components for collaborative filtering in adaptive tutoring systems. In Proceedings of the 8th International Conference on Educational Data Mining, Madrid, 2015. [pdf]
  • Barbara Plank, Hector Martinez Alonso and Anders Søgaard. Non-canonical language is not harder to annotate than canonical language. In the 9th Linguistic Annotation Workshop (LAW IX), NAACL. (Invited) [pdf from ACL anthology]
  • Sarah McGillion, Hector Martinez Alonso and Barbara Plank. CPH: Sentiment analysis of Figurative Language on Twitter #easypeasy #not. In SemEval-2015. [pdf] [pdf from ACL anthology]
  • Hector Martinez Alonso, Barbara Plank, Arne Skjærholt and Anders Søgaard. Learning to parse with IAA-weighted loss. In NAACL 2015. [pdf] [pdf from ACL anthology]
  • Dirk Hovy, Barbara Plank, Hector Martinez Alonso and Anders Søgaard. Mining for unambiguous instances to adapt POS taggers to new domains. In NAACL 2015. [pdf] [pdf from ACL anthology]
  • Hector Martinez Alonso, Barbara Plank, Anders Johannsen and Anders Søgaard. Active learning for sense annotation. In NoDaLiDa 2015. [pdf from ACL anthology]
  • Anders Søgaard, Barbara Plank, Hector Martinez Alonso. Using frame semantics for knowledge extraction from Twitter. In AAAI, 2015. [pdf]

2014

2013

  • Barbara Plank and Alessandro Moschitti. Embedding Semantic Similarity in Tree Kernels for Domain Adaptation of Relation Extraction. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL 2013), Sofia, Bulgaria, August 2013. [abstract] [pdf from ACL anthology] [bib]
  • Barbara Plank, Thomas Sauer, Ina Schaefer. Supporting Agile Software Development by Natural Language Processing. In A. Moschitti and B. Plank (Eds.): EternalS 2013, CCIS 379, pp. 91--102. Springer, Heidelberg (2013)

2012

  • Anders Søgaard and Barbara Plank. Parsing the web as covariate shift. In Notes of the First Workshop on Syntactic Analysis of Non-Canonical Language (SANCL). Montreal, Canada. 2012
  • Felice Dell'Orletta, Simone Marchi, Simonetta Montemagni, Barbara Plank, Giulia Venturi. The SPLeT--2012 Shared Task on Dependency Parsing of Legal Texts. In Proceedings of the 4th Workshop on Semantic Processing of Legal Texts 2012, May 2012, Istanbul, Turkey. 2012. [pdf]
  • Barbara Plank and Anders Søgaard. Experiments in newswire-to-law adaptation of graph-based dependency parsers. In: Magnini, B., Cutugno, F., Falcone, M. and Pianta, E. (eds.) Evaluation of Natural Language and Speech Tools for Italian (EVALITA 2011), Lecture Notes in Computer Science, vol. 7689. Springer, Heidelberg (2012). Shorter version published in working notes of EVALITA 2011, 23-24th January 2012, Rome, Italy. 2012.
  • [pdf] Evalita working notes homepage] [bib] [Springer]

2011

2010

  • Barbara Plank and Gertjan van Noord. Dutch Dependency Parser Performance Across Domains. In Proceedings of the 20th Meeting of Computational Linguistics in the Netherlands. [pdf] [LOT occasional series page]
  • Barbara Plank and Gertjan van Noord. Grammar-driven versus Data-driven: Which Parsing System is More Affected by Domain Shifts? In Proceedings of the ACL 2010 Workshop on NLP and Linguistics: Finding the Common Ground, Uppsala, Sweden, July, 2010. [pdf | pdf from ACL Anthology] [slides] [bib]
  • Barbara Plank. Improved statistical measures to assess natural language parser performance across domains. In Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC2010), Valletta, Malta, May 2010.
    [pdf] [pdf from ELRA] [poster] [bib]

2009

2008

  • Barbara Plank and Khalil Sima'an. Subdomain Sensitive Statistical Parsing using Raw Corpora. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC2008), Marrakech, Morocco, May 2008.
    [pdf] [slides] [bib]
  • Barbara Plank and Gertjan van Noord. Exploring an Auxiliary Distribution based approach to Domain Adaptation of a Syntactic Disambiguation Model. In Proceedings of the Coling 2008 Workshop on Cross-Framework and Cross-Domain Parser Evaluation (PE), pages 9--16, Manchester, United Kingdom, August 2008.
    [pdf] [slides] [bib]
  • Barbara Plank and Khalil Sima'an. Parsing with Subdomain Instance Weighting from Raw Corpora. In Proceedings of INTERSPEECH 2008, Brisbane, Australia, September 2008.
    [pdf] [poster] [bib]

2006

  • Raffaella Bernardi, Diego Calvanese, Luca Dini, Vittorio Di Tomaso, Elisabeth Frasnelli, Ulrike Kugler, Barbara Plank. Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano. In Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC2006), Genova, Italy, 2006.

Tutorials

  • Barbara Plank. No black magic: text processing using the Unix command line [slides]
    The command line interface —invented decades ago, long before the graphical user interface — is an amazing tool for gaining quick insights into data. By combining small, yet powerful utilities you can analyze your data quickly to extract information or create exciting visualizations. In this tutorial, we will provide a hands-on introduction to Unix command line utilities to demystify the “black window”.
    Topics:
    • command line concepts, looking at files (cat,less,head,tail), navigation, searching files (grep and regular expressions)
    • combining commands using the pipe: example of generating frequency lists quickly (less, grep, sed, sort, uniq, cut)
    • brief outlook to more advanced topics: data visualization in R (histograms, scatterplots, bar plots)

    References:
    • Ken Church. Unix™ for Poets
    • Nikolaj Lindberg. http://stts.se/egrep_for_linguists/egrep_for_linguists.pdf
    • Jeroen Janssens. Data Science at the Command Line. O’Reilly. 2014
  • Søgaard, Anders; Plank, Barbara; Hovy, Dirk. 2014. Selection bias, label bias, and bias in ground truth (tutorial). The 25th International Conference on Computational Linguistics (COLING). Dublin, Ireland. [abstract]

Editorial Work

  • Proceedings of Computational Linguistics in the Netherlands 2009: Selected papers from the nineteenth CLIN meeting. Edited by Barbara Plank, Erik Tjong Kim Sang, Tim van der Cruys. LOT Occasional Series 14, Utrecht, 2010. online
  • Proceedings of the 2010 Workshop on Domain Adaptation for Natural Language Processing (DANLP 2010). H. Daume III, T. Deoskar, D. McClosky, B. Plank, J. Tiedemann. bib
  • Proceedings of the 2012 Joint Workshop on Intelligent Methods for Software System Engineering. Stamatia Bibi, Alessandro Moschitti, Barbara Plank, Ioannis Stamelos.

Other

  • Barbara Plank. Linguistic Landscapes in Bozen-Bolzano. Bozen-Bolzano. June, 2006. [slides] (unpublished)

Theses

MSc Thesis

BSc Thesis

  • Barbara Plank. Porting web based applications in mixed environments: An ASP.NET application on the Mono project. Bachelor's thesis, University of Bozen-Bolzano, 2005. Supervisor: Andrea Molinari (University of Trento).
Home