INDEX
Explanations
instances of hypocrisy or contradictions in behavior and beliefs
New Auto-Interp
Negative Logits
StructEnd
-0.67
препратки
-0.67
autorytatywna
-0.66
Expedia
-0.66
isome
-0.64
providedIn
-0.63
Referanser
-0.62
CppMethod
-0.62
Jeografia
-0.60
########.
-0.60
POSITIVE LOGITS
own
0.90
myself
0.81
sendiri
0.73
自分も
0.70
Own
0.70
próprio
0.69
propia
0.69
eigenes
0.68
himself
0.64
personally
0.64
Activations Density 0.288%