Accepted Papers

Full papers:
  • Rodrigo Chamun, Daniele Pinheiro, Diego Jornada, João Oliveira and Isabel Manssour. Extracting web content for personalized presentation
  • Boris Dadachev, Alexander Balinsky and Helen Balinsky. On Automatic Text Segmentation
  • Markus Diem, Florian Kleber and Robert Sablatnig. Ruling Analysis and Classification of Torn Documents
  • Hannes Dohrn and Dirk Riehle. Fine-grained Change Detection in Structured Text Documents
  • Klaas Andries de Graaf, Peng Liang, Antony Tang and Hans van Vliet. The Impact of Prior Knowledge on Searching in Software Documentation
  • Joshua Hailpern, Niranjan Damera-Venkata and Marina Danilevsky. Pagination: It's what you say, not how long it takes to say it
  • Joshua Hailpern, Niranjan Damera-Venkata and Marina Danilevsky. Truncation: All the News that Fits We'll Print
  • Rinaldo Lima, Jamilson Batista, Rafael Ferreira, Fred Freitas, Rafael Lins, Steven Simske and Marcelo Riss. Simplifying Graph-based Representations of Sentences for Relation Extraction
  • Wei Liu, Wei Fan, Jun Sun and Naoi Satoshi. Paper Stitching using Maximum Tolerant Seam under Local Distortions
  • Diogo Martins and Maria Da Graça Pimentel. ActiveTimesheets: extending Web-based multimedia documents with dynamic modification and reuse features
  • Rafael Mello, Rafael Lins, Fred Freitas, Steven J. Simske, Bruno Avila, Rodolfo Ferreira and Marcelo Riss. A New Sentence Similarity Assessment Measure based on a Three-Layer Sentence Representation
  • Seyednaser Nourashrafeddin, Evangelos Milios and Dirk Arnold. An Ensemble Approach for Text Document Clustering using Wikipedia Concepts
  • Patrick Schmitz and Laurie Pearce. Humanist-centric tools for Big Data: Berkeley Prosopography Services
  • Alan Souza, Viviane Moreira and Carlos Heuser. ARTIC: Metadata Extraction from Scientific Papers in PDF using Two-Layer CRF
  • Kyle Williams, Hung-Hsuan Chen and C. Lee Giles. Classifying and Ranking Search Engine Results as Potential Sources of Plagiarism
Short papers, application notes and posters:
  • John Barrus and Edward Schwartz. Image-Based Document Management: Aggregating Collections of Handwritten Forms
  • Marti Bosch, Pierre Geneves and Nabil Layaida. Automated CSS Optimization by Logical Reasoning
  • Luciano Cabral, Rafael Lins, Rafael Mello, Fred Freitas, Bruno Avila, Steven Simske and Marcelo Riss. A Platform for Language Independent Summarization
  • Owen Davison, Abidalrahman Mohammad and Evangelos Milios. P-GTM: Privacy-Preserving Google Tri-gram Method for Semantic Text Similarity
  • Laurent Denoue, Matthew Cooper, Andreas Girgensohn and Scott Carter. Building digital project rooms for web meetings
  • Stefano Ferilli, Domenico Grieco, Domenico Redavid and Floriana Esposito. Abstract Argumentation for Reading Order Detection
  • Juliane Franze, Kim Marriott and Michael Wybrow. What Academics Want When Reading Digitally
  • Michael Gormish and John Barrus. Connecting Content and Annotations with LiveStroke
  • Jacob Hughes, David Brailsford, Steven Bagley and Clive Adams. Generating Summary Documents from a Variable-Quality PDF Document Collection
  • Mariana Kolberg, Luiz Fernandes, Mateus Raeder and Carolina Fonseca. JAR Tool: Using Document Analysis for Improving the Throughput of High Performance Printing Environments
  • Wanqiong Pan, Zhouhui Lian, Rongju Sun, Yingmin Tang and Jianguo Xiao. FlexiFont: A Flexible System to Generate Personal Font Libraries
  • Mira Sarkis, Cyril Concolato and Jean-Claude Dufourd. The Virtual Splitter: Refactoring Web Applications for the Multiscreen Environment
  • Robert Ulichney, Matthew Gaubatz and Steven Simske. Circular Coding with Interleaving Phase
  • Kyle Williams, Jian Wu and C. Lee Giles. SimSeeX: A Similar Document Search Engine