INDEX
Explanations
sentence endings or punctuation marks indicating finality
New Auto-Interp
Negative Logits
kick
-0.18
ruk
-0.16
(
-0.16
erie
-0.16
taken
-0.15
pt
-0.15
é§
-0.15
ms
-0.15
taken
-0.15
lich
-0.15
POSITIVE LOGITS
çħ§
0.17
åĽ
0.16
ylland
0.15
/frontend
0.15
FunctionFlags
0.15
uegos
0.15
iminal
0.15
Intialized
0.14
serg
0.14
CallCheck
0.14
Activations Density 0.003%