INDEX
Explanations
phrases indicating choices or options involving multiple parties
New Auto-Interp
Negative Logits
جÙħ
-0.14
Singap
-0.14
æ¤
-0.14
اØŃÙĦ
-0.13
deÅŁ
-0.13
orer
-0.13
]|[
-0.13
ater
-0.13
linkplain
-0.13
braco
-0.12
POSITIVE LOGITS
again
0.17
again
0.17
Again
0.15
nown
0.14
Again
0.14
ingleton
0.14
itself
0.14
ëŀĻ
0.14
neider
0.14
permanently
0.13
Activations Density 0.009%