INDEX
Explanations
expressions of hope and optimism
New Auto-Interp
Negative Logits
ubat
-0.17
cono
-0.15
ÑħÑĢан
-0.15
olders
-0.15
.rdf
-0.15
abile
-0.15
bett
-0.15
ียว
-0.14
osaic
-0.14
ÎķÎĻ
-0.14
POSITIVE LOGITS
hope
0.22
hopes
0.20
hope
0.18
lessly
0.18
Hope
0.17
Hope
0.17
quip
0.16
ilos
0.16
hop
0.16
Trom
0.15
Activations Density 0.042%