INDEX
Explanations
the word "boys" with varying activations, emphasizing a focus on this specific term
occurrences of the word "boys."
New Auto-Interp
Negative Logits
mediated
-0.73
uncture
-0.71
Accessory
-0.69
Sharp
-0.69
aeda
-0.68
ãĥķãĤ©
-0.67
osi
-0.66
ascular
-0.66
inventoryQuantity
-0.66
itures
-0.66
POSITIVE LOGITS
boys
0.97
Scouts
0.97
friend
0.85
hood
0.83
boys
0.81
hift
0.79
puberty
0.78
ages
0.77
Boys
0.77
girls
0.76
Activations Density 0.013%