INDEX
Explanations
references to 'boy' in various contexts
references to "boy" and "boys."
New Auto-Interp
Negative Logits
aeda
-0.78
mble
-0.78
cumbers
-0.78
conflic
-0.75
¥ŀ
-0.73
plurality
-0.72
millenn
-0.72
srf
-0.72
subsistence
-0.71
undermin
-0.70
POSITIVE LOGITS
boy
1.63
friend
1.37
boys
1.29
hood
1.14
girl
1.07
hole
1.06
cott
0.99
pool
0.95
holes
0.95
bags
0.95
Activations Density 0.005%