INDEX
Explanations
phrases related to reports and statements made in various contexts
New Auto-Interp
Negative Logits
imilar
-0.69
ogenic
-0.67
ãĥİ
-0.66
estern
-0.64
sickness
-0.63
icio
-0.61
Birthday
-0.61
ãĥĩ
-0.56
animate
-0.56
stump
-0.55
POSITIVE LOGITS
quoting
0.81
adding
0.74
bluntly
0.72
citing
0.70
lege
0.65
.
0.64
omin
0.64
cris
0.63
referring
0.63
noting
0.61
Activations Density 0.091%