INDEX
Explanations
mentions of the word "giant" in the text
references to large creatures or entities, particularly those described as "giant."
New Auto-Interp
Negative Logits
yrinth
-0.80
say
-0.69
odore
-0.67
rb
-0.67
treatment
-0.67
schild
-0.67
informed
-0.66
ties
-0.66
qi
-0.65
ntax
-0.65
POSITIVE LOGITS
squid
1.04
leap
0.87
bould
0.86
leaps
0.85
gorilla
0.83
aster
0.82
sized
0.79
Leap
0.78
Squid
0.78
eyeb
0.77
Activations Density 0.040%