INDEX
Explanations
references to the word "ginger"
references to ginger
New Auto-Interp
Negative Logits
igslist
-0.79
ĵĺ
-0.75
ingu
-0.71
isites
-0.71
iries
-0.67
oln
-0.65
Colleges
-0.62
leased
-0.61
Educational
-0.61
loopholes
-0.60
POSITIVE LOGITS
bread
1.63
ginger
1.07
ale
0.91
Ginger
0.91
bats
0.91
lings
0.87
bum
0.84
grass
0.84
weed
0.83
stones
0.82
Activations Density 0.008%