INDEX
Explanations
expressions of worry or concern
New Auto-Interp
Negative Logits
readcr
-0.15
eller
-0.15
hoe
-0.15
Ñĸж
-0.15
Colony
-0.14
agas
-0.14
idan
-0.14
ARK
-0.14
laut
-0.14
sass
-0.14
POSITIVE LOGITS
ãĥŃãĥ¼
0.17
judgement
0.16
sust
0.15
Pierce
0.15
judgment
0.15
uto
0.15
NP
0.15
é¤
0.14
Swinger
0.14
Teh
0.14
Activations Density 0.226%