INDEX
Explanations
references to personal feelings and relationships
New Auto-Interp
Negative Logits
obec
-0.16
KeyId
-0.16
exactly
-0.15
obi
-0.15
op
-0.15
santé
-0.15
isis
-0.14
ilent
-0.14
lund
-0.14
indeed
-0.14
POSITIVE LOGITS
azo
0.17
succ
0.17
sometimes
0.16
anton
0.15
tti
0.15
lein
0.14
hus
0.14
enef
0.14
UILD
0.14
het
0.14
Activations Density 0.104%