INDEX
Explanations
phrases related to guidance or advice
after "style" or "being"
New Auto-Interp
Negative Logits
ٌ
-0.55
ize
-0.38
main
-0.38
near
-0.38
i
-0.37
off
-0.37
[
-0.37
-0.36
räume
-0.36
itet
-0.36
POSITIVE LOGITS
Hochspringen
0.86
Roskov
0.80
}}"></
0.78
nakalista
0.76
']))
0.76
ftagPool
0.76
}")
0.75
$}}
0.74
ComVisible
0.74
utafitiHapana
0.74
Activations Density 0.047%