Eastern Michigan University
direct edit

Damir Cavar

Pray-Harrold

Office 613D

734.487.1148

damir.cavar@emich.edu

Education

PhD (Dr. phil): 1999/2000 University of Potsdam

Interests and Expertise

Language technology, corpus and computational linguistics.

Courses

LSA Summer Institute Course:

Fall 2013:

  • LING 592 Special Topics: Statistics for Language Study (Fall 2013, THU 6:30-9:10 PM, PRAY-H 618)
  • LING 379 Special Topics: Language Tools (Fall 2013, 2-3:15 PM, MO & WED PRAY-H 521)

Office hours:

  • PRAY-H 613D: MO & WED 1-2 PM, THU 5-6:30
  • Cooper Building (2000 N Huron River Dr., Suite 104): MO-FR 9 AM - 1 PM
  • and by arrangement

Recent Publications and Presentations

Damir Cavar, Helen Aristar-Dry, Anthony Aristar (2012) Large Mailing List Corpora: Management, Annotation and Repository. In LREC 2012 Proceedings of the workshop on Challenges in the management of large corpora.

Damir Cavar, Dunja Brozović Rončević (to appear 2012) Riznica: The Croatian Language Corpus. In Prace Filologiczne , Warsaw in 2012.

Damir Cavar, Melanie Seiss (2011) Clitic Placement, Syntactic Discontinuity, and Information Structure . In LFG Proceedings 2011. ISSN 1098-6782

Damir Cavar, Tanja Gulan, Damir Kero, Franjo Pehar, Pavle Valerjev (2011) The Scheme Natural Language Toolkit (SNLTK): NLP libraries for R6RS and Racket. In Proceedings of the 4th European Lisp Symposium, Hamburg University of Technology, pp. 58-61.

Damir Cavar (2010) On Statistical Metrics for Selection and Phrasality. In T. Hanneforth and G. Fanselow (eds.) Language and Logos . Akademie Verlag, Berlin. ISBN 978-3050049311

Damir Cavar, Ivo-Pavao Jazbec, Siniša Runjaić (2009) Efficient Morphological Parsing with a Weighted Finite State Transducer. Informatica 33/1 , pp. 107-113. Website of the journal. ISSN: 0350-5596

 

Presentations:

Bootstrapping large text corpora with TEI XML markup and linguistic annotation. Together with Malgosia E. Cavar. Chicago Colloquium on Digital Humanities and Computer Science. The University of Chicago. 19th of November 2012.

Automatic Linguistic Annotation with TEI-Output. TEI 2012 Conference at Texas A&M, College Station. 9th of November 2012.

The LINGUIST List Corpus: A Large Mailing List Corpus – Management, Annotation and Repository. Together with Malgorzata E. Cavar, Helen Dry Aristar, and Anthony Aristar. TEI 2012 Conference at Texas A&M, College Station. 9th of November 2012.

The Project Gutenberg book archive as a TEI P5 XML text corpus. Together with Malgorzata E. Cavar. TEI 2012 Conference at Texas A&M, College Station. 10th of November 2012.

Dynamic Professional Content Corpora and New Technologies. Wayne State University, 19th of October 2012.

Large Mailing List Corpora: Management, Annotation and Repository. Together with Helen Aristar-Dry and Anthony Aristar. LREC 2012 Workshop on Challenges in the management of large corpora, Istanbul, 22nd of May, 2012.

Sprachtechnologie und Sprachdokumentation: Eine Darstellung von Projekten des Heimatinstituts der LINGUIST List. Institut für Deutsche Sprache, Mannheim. 8th of May, 2012.

Bootstrapping NLP and MT Resources for under-resourced languages. Crosslingual Language Technology in service of an integrated multilingual Europe - 20 years on-. University of Hamburg, Germany, 4-5 May 2012.

On Split Islands. Presented at the Syntax/Semantics Discussion Group Meeting in the Linguistics Department at the University of Michigan. 20th of Jan. 2012. (Abstract, Handout)

Cyclicity and Opacity Effects in the Prosody of Two Different Clitic Classes in New ­Shtokavian Variants. (Abstract, Slides) Presented at: Ilse Lehiste Memorial Symposium (Program). Together with Malgorzata E. Cavar. 11th of November 2011.

Clitic Placement, Syntactic Discontinuity, and information structure . With Melanie Seiß. (Handout, Slides) LFG 2011, Hong Kong. 17th of July 2011.

The Scheme Natural Language Toolkit (S-NLTK): NLP Library for R6RS and Racket. 4 th European Lisp Symposium, Special Focus on Parallelism & Efficiency, TUHH, Hamburg University of Technology, Hamburg, Germany, 1st of April 2011.