INDEX
Explanations
references to interpersonal relationships and communication involving 'you' and 'me'
New Auto-Interp
Negative Logits
Ãłm
-0.15
emy
-0.15
zie
-0.15
çĽĸ
-0.14
imin
-0.14
quam
-0.14
uez
-0.14
oka
-0.14
ieux
-0.14
acios
-0.14
POSITIVE LOGITS
iram
0.16
coni
0.15
788
0.15
bÃŃr
0.15
Blasio
0.14
opause
0.14
éĽĨä¸Ń
0.14
@"";↵
0.14
UPPORTED
0.14
izza
0.13
Activations Density 0.299%