The Ubuntu Chat Corpus for Multiparticipant Chat Analysis

reportActive / Technical Report | Accession Number: ADA602658 | Open PDF

Abstract:

We present the Ubuntu Chat Corpus as a data source for multiparticipant chat analysis. This addresses the problem of the lack of a large, publicly suitable corpora for research in this medium. The advantages of using this corpus for research is its large number of chat messages its multiple languages, its technical nature, and all of the original chat messages are in the public domain.

Security Markings

DOCUMENT & CONTEXTUAL SUMMARY

Distribution:
Approved For Public Release
Distribution Statement:
Approved For Public Release; Distribution Is Unlimited.

RECORD

Collection: TR
Identifying Numbers
Subject Terms