ログイン
Language:

WEKO3

  • トップ
  • ランキング
To
lat lon distance
To

Field does not validate



インデックスリンク

インデックスツリー

メールアドレスを入力してください。

WEKO

One fine body…

WEKO

One fine body…

アイテム

  1. A500 情報学部/情報学研究科・情報文化学部・情報科学研究科
  2. A500e 会議資料
  3. 国際会議

Nonword-to-Image Generation Considering Perceptual Association of Phonetically Similar Words

http://hdl.handle.net/2237/0002010850
http://hdl.handle.net/2237/0002010850
f660a8cd-8e4f-471e-8d99-6822e90ce7c7
名前 / ファイル ライセンス アクション
matsuhirac_ACMMM2023_McGE.pdf matsuhirac_ACMMM2023_McGE.pdf (8.7 MB)
アイテムタイプ itemtype_ver1(1)
公開日 2024-05-23
タイトル
タイトル Nonword-to-Image Generation Considering Perceptual Association of Phonetically Similar Words
言語 en
著者 Matsuhira, Chihaya

× Matsuhira, Chihaya

en Matsuhira, Chihaya

Search repository
Kastner, Marc A.

× Kastner, Marc A.

en Kastner, Marc A.

Search repository
Komamizu, Takahiro

× Komamizu, Takahiro

en Komamizu, Takahiro

Search repository
Hirayama, Takatsugu

× Hirayama, Takatsugu

en Hirayama, Takatsugu

Search repository
Doman, Keisuke

× Doman, Keisuke

en Doman, Keisuke

Search repository
Ide, Ichiro

× Ide, Ichiro

en Ide, Ichiro

Search repository
アクセス権
アクセス権 open access
アクセス権URI http://purl.org/coar/access_right/c_abf2
内容記述
内容記述タイプ Abstract
内容記述 Text-to-Image (T2I) generation has long been a popular field of multimedia processing. Recent advances in large-scale vision and language pretraining have brought a number of models capable of very high-quality T2I generation. However, they are reported to generate unexpected images when users input words that have no definition within a language (nonwords), including coined words and pseudo-words. To make the behavior of T2I generation models against nonwords more intuitive, we propose a method that considers phonetic information of text inputs. The phonetic similarity is adopted so that the generated images from a nonword contain the concept of its phonetically similar words. This is based on the psycholinguistic finding that humans would also associate nonwords with their phonetically similar words when they perceive the sound. Our evaluations confirm a better agreement of the generated images of the proposed method with both phonetic relationships and human expectations than a conventional T2I generation model. The cross-lingual comparison of generated images for a nonword highlights the differences in language-specific nonword-imagery correspondences. These results provide insight into the usefulness of the proposed method in brand naming and language learning.
言語 en
内容記述
内容記述タイプ Other
内容記述 MM '23: The 31st ACM International Conference on Multimedia Ottawa ON Canada 29 October 2023
言語 en
出版者
出版者 Association for Computing Machinery
言語 en
言語
言語 eng
資源タイプ
資源タイプresource http://purl.org/coar/resource_type/c_5794
タイプ conference paper
出版タイプ
出版タイプ AM
出版タイプResource http://purl.org/coar/version/c_ab4af688f83e57aa
関連情報
関連タイプ isVersionOf
識別子タイプ DOI
関連識別子 https://doi.org/10.1145/3607541.3616818
関連情報
関連タイプ isPartOf
識別子タイプ ISBN
関連識別子 979-8-4007-0278-5
書誌情報 en : McGE '23: Proceedings of the 1st International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice

p. 115-125, 発行日 2023-10
戻る
0
views
See details
Views

Versions

Ver.1 2024-05-23 05:43:08.435966
Show All versions

Share

Share
tweet

Cite as

Other

print

エクスポート

OAI-PMH
  • OAI-PMH JPCOAR 2.0
  • OAI-PMH JPCOAR 1.0
  • OAI-PMH DublinCore
  • OAI-PMH DDI
Other Formats
  • JSON
  • BIBTEX
  • ZIP

コミュニティ

確認

確認

確認


Powered by WEKO3


Powered by WEKO3