INDEX
Explanations
instances of the word 'little'
instances of the word "little."
New Auto-Interp
Negative Logits
eneg
-0.93
osponsors
-0.78
ovies
-0.78
itivity
-0.76
idents
-0.76
atars
-0.76
apons
-0.76
iership
-0.75
ocity
-0.74
yrim
-0.74
POSITIVE LOGITS
bit
1.42
peek
0.85
glimpse
0.84
BIT
0.82
rusty
0.79
tad
0.77
girl
0.76
chunk
0.75
patience
0.75
extra
0.73
Activations Density 0.027%