INDEX
Explanations
questions and statements about beliefs or opinions
Question-like utterances or statements of disagreement
question tags and confirmations
New Auto-Interp
Negative Logits
termica
-0.52
cucchia
-0.49
communiquez
-0.46
Dott
-0.45
.}\
-0.44
Siti
-0.43
Outback
-0.43
zcze
-0.43
ù
-0.43
contratto
-0.43
POSITIVE LOGITS
ymce
0.77
grees
0.69
XCTest
0.68
ocry
0.67
phors
0.65
uchtet
0.65
estanden
0.65
$_['
0.64
graphique
0.64
ínű
0.63
Activations Density 0.052%