INDEX
Explanations
references to the word "giant" along with other context-specific terms
mentions of "giant" and its variations in contexts related to scale or size
New Auto-Interp
Negative Logits
say
-0.70
nikov
-0.66
nces
-0.65
ggies
-0.65
ople
-0.63
rals
-0.63
ntax
-0.62
akers
-0.62
cause
-0.61
rina
-0.61
POSITIVE LOGITS
squid
1.16
Squid
0.89
bould
0.85
leap
0.83
Panda
0.80
ess
0.80
ape
0.78
leaps
0.78
strides
0.77
Slayer
0.76
Activations Density 0.063%