Reducing Excessive Amounts of Data : Multiple Web Queries for Generation of Pun Candidates

Joint Authors

Dybala, Pawel
Sayama, Kohichi
Ptaszynski, Michal

Source

Advances in Artificial Intelligence

Issue

Vol. 2011, Issue 2011 (31 Dec. 2011), pp.1-12, 12 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2012-02-29

Country of Publication

Egypt

No. of Pages

12

Main Subjects

Information Technology and Computer Science
Science

Abstract EN

Humor processing is still a less studied issue, both in NLP and AI.

In this paper we contribute to this field.

In our previous research we showed that adding a simple pun generator to a chatterbot can significantly improve its performance.

The pun generator we used generated only puns based on words (not phrases).

In this paper we introduce the next stage of the system's development—an algorithm allowing generation of phrasal pun candidates.

We show that by using only the Internet (without any hand-made humor-oriented lexicons), it is possible to generate puns based on complex phrases.

As the output list is often excessively long, we also propose a method for reducing the number of candidates by comparing two web-query-based rankings.

The evaluation experiment showed that the system achieved an accuracy of 72.5% for finding proper candidates in general, and the reduction method allowed us to significantly shorten the candidates list.

The parameters of the reduction algorithm are variable, so that the balance between the number of candidates and the quality of output can be manipulated according to needs.

American Psychological Association (APA)

Dybala, Pawel& Ptaszynski, Michal& Sayama, Kohichi. 2012. Reducing Excessive Amounts of Data : Multiple Web Queries for Generation of Pun Candidates. Advances in Artificial Intelligence،Vol. 2011, no. 2011, pp.1-12.
https://search.emarefa.net/detail/BIM-446962

Modern Language Association (MLA)

Dybala, Pawel…[et al.]. Reducing Excessive Amounts of Data : Multiple Web Queries for Generation of Pun Candidates. Advances in Artificial Intelligence No. 2011 (2011), pp.1-12.
https://search.emarefa.net/detail/BIM-446962

American Medical Association (AMA)

Dybala, Pawel& Ptaszynski, Michal& Sayama, Kohichi. Reducing Excessive Amounts of Data : Multiple Web Queries for Generation of Pun Candidates. Advances in Artificial Intelligence. 2012. Vol. 2011, no. 2011, pp.1-12.
https://search.emarefa.net/detail/BIM-446962

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-446962