INDEX
Explanations
relationships and conflicts involving family dynamics
New Auto-Interp
Negative Logits
avis
-0.15
едаг
-0.14
ãĥ³ãĥķ
-0.14
zag
-0.14
obic
-0.13
uggy
-0.13
oki
-0.13
antis
-0.13
šk
-0.13
arring
-0.13
POSITIVE LOGITS
ÑĤоже
0.68
also
0.64
too
0.63
also
0.54
too
0.54
ebenfalls
0.51
också
0.50
ALSO
0.49
également
0.49
ä¹Łæĺ¯
0.48
Activations Density 1.189%