INDEX
Explanations
discussions about personal growth and change
New Auto-Interp
Negative Logits
ibo
-0.15
ungan
-0.15
arga
-0.15
utin
-0.14
literal
-0.14
दम
-0.14
oton
-0.14
ανδ
-0.13
ton
-0.13
Gregg
-0.13
POSITIVE LOGITS
Exactly
0.17
604
0.16
elay
0.16
exactly
0.16
AA
0.15
404
0.15
lemn
0.15
ilty
0.14
elden
0.14
xp
0.14
Activations Density 0.146%