What is Humtext?
Humtext. Corpus of written humor is an online corpus specifically designed for the study of written humor in printed publications, such as newspapers, brochures, magazines, almanacs, books, booklets, or fanzines.This corpus complements the multimodal database Humcor, which collects over 120 years of oral humorous production from different varieties of Spanish from Spain, Latin America, and Equatorial Guinea.
What does Humtext include?
It constitutes a database of humorous texts — jokes, anecdotes, epigrams, obituaries, epitaphs, short stories, articles, chronicles, or news pieces — also drawn from different formats, such as cartoons, comic strips, among others.
What time period does Humtext cover?
The corpus spans a broad historical period, with materials dating from 1495 to 2025—that is, over 500 years of written humorous production. The texts have been gathered from online historical archives, personal collections, libraries, and digital platforms.
What is the purpose of Humtext?
The project aims to systematically and extensively study and document written humor in Spanish as it appears in printed texts over more than five centuries of history. It also includes the digitization of materials from the written texts, with the goal of creating a digital library dedicated to written humor.