INDEX
Explanations
references to large or oversized objects
references to "giant" entities or characters
New Auto-Interp
Negative Logits
yrinth
-0.91
anwhile
-0.83
qi
-0.77
endment
-0.77
earchers
-0.76
imester
-0.73
eligible
-0.73
rences
-0.72
informed
-0.72
iggins
-0.72
POSITIVE LOGITS
squid
1.03
chunk
0.85
sized
0.83
gorilla
0.81
leap
0.79
monster
0.79
gest
0.78
bould
0.77
chunks
0.76
slug
0.76
Activations Density 0.020%