INDEX
Explanations
the word "little" preceded by another word
occurrences of the word "little."
New Auto-Interp
Negative Logits
itivity
-0.81
igham
-0.77
CoC
-0.72
rite
-0.70
anwhile
-0.66
zanne
-0.66
idon
-0.65
rette
-0.64
eneg
-0.64
idents
-0.64
POSITIVE LOGITS
bit
0.85
kid
0.68
thing
0.67
legged
0.66
tid
0.66
patience
0.65
girls
0.65
avail
0.64
finger
0.64
girl
0.64
Activations Density 0.025%