INDEX
Explanations
the name "Amanda" in various contexts
occurrences of the name "Amanda."
New Auto-Interp
Negative Logits
direction
-0.77
vous
-0.73
safety
-0.72
book
-0.71
roups
-0.70
lag
-0.69
shadow
-0.69
perse
-0.67
grading
-0.67
istry
-0.67
POSITIVE LOGITS
Berry
1.05
Amanda
1.00
Knox
0.94
Matthews
0.90
Todd
0.86
gdala
0.80
Carter
0.80
Nunes
0.80
Joyce
0.79
Bradley
0.79
Activations Density 0.010%