%PDF-1.6
%
1 0 obj
<>
endobj
2 0 obj
<>
endobj
3 0 obj
<>stream
IEEE
ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2020; ; ;
Vision and spoken language
semantic embedding space
self-attention
and cross-lingual retrieval
Trilingual Semantic Embeddings of Visually Grounded Speech with Self-Attention Mechanisms
Yasunori Ohishi
Akisato Kimura
Takahito Kawanishi
Kunio Kashino
David Harwath
James Glass
endstream
endobj
4 0 obj
<>stream
x+ |
endstream
endobj
5 0 obj
<>stream
xN@D)AB8 KeEJk5ldߚ53iOz?JGw*CUef3ۇF39qHĹbM|qLbWh?]Bg$x0hwζy28*WE-fzBuK۶8NF.WNd!_Op-p/zӎ ?
endstream
endobj
6 0 obj
<>stream
x+ |
endstream
endobj
7 0 obj
<>stream
x=O@DS# 2"5ٷn
_MF4'=x=J{7*]U4ujػF3F9qHĹLbM[|qLb ?}Bo.$x0jζy6^97ܪMqmS1DuE9h<^
\>stream
x+ |
endstream
endobj
9 0 obj
<>stream
xN@D)AB8 KeEJk5ldߚ53iOz?JGw*CUef3ۇF39qHĹbM|qLbWh?]Bg$x0hwζy28*WE-fzBuK۶8NF.WNd!_Op-p/zӎ `?
endstream
endobj
10 0 obj
<>stream
x+ |
endstream
endobj
11 0 obj
<>stream
xN@D)AB8 KeEJk5ldߚ53iOz?JGw*CUef3ۇF39qHĹbM|qLbWh?]Bg$x0hwζy28*WE-fzBuK۶8NF.WNd!_Op-p/zӎ ?
endstream
endobj
12 0 obj
<>stream
x+ |
endstream
endobj
13 0 obj
<>stream
xN@D)AB8 KeEJk5ldߚ53iOz?#|OK;P*tC8s\F̦
A8&+.3ld
>>stream
HW{Tu} [v}RuE%G&!$Y"S^-v'37d$ęNmY*jB-](v]ԮֵVףg{u{'?߽}~3M&B5mQ8!}~Tb"j:XL
3Q))Te9P#sqC0*XY# ꀏsG&stH