INDEX
Explanations
symbols or special characters indicating lists
bullet points or list indicators
New Auto-Interp
Negative Logits
ierre
-0.78
graz
-0.77
erer
-0.75
othal
-0.75
udic
-0.71
Tanz
-0.67
nuts
-0.67
enthal
-0.64
resent
-0.63
bris
-0.63
POSITIVE LOGITS
··
1.14
âĢ¢âĢ¢
0.89
·
0.85
sim
0.83
¼
0.82
âĢ¢âĢ¢âĢ¢âĢ¢
0.82
NET
0.78
Pg
0.76
Reason
0.74
¾
0.74
Activations Density 0.005%