INDEX
Explanations
statements attributed to speakers in a narrative context
New Auto-Interp
Negative Logits
antan
-0.15
aram
-0.15
acia
-0.15
èº
-0.15
á»ĵm
-0.15
alis
-0.14
hus
-0.14
idlo
-0.14
.mas
-0.14
elop
-0.13
POSITIVE LOGITS
reporters
0.20
ÙĴس
0.18
us
0.18
ousel
0.16
yb
0.15
pragma
0.15
OUNDS
0.15
usb
0.14
à¤Ĥà¤Ĺ
0.14
esson
0.14
Activations Density 0.035%