INDEX
Explanations
references to individuals named Alex
mentions of the name "Alex."
New Auto-Interp
Negative Logits
phone
-0.67
staff
-0.65
discouraging
-0.65
Demand
-0.65
purpose
-0.65
tips
-0.63
office
-0.63
final
-0.63
student
-0.63
liness
-0.62
POSITIVE LOGITS
Anton
0.92
Alex
0.92
iev
0.86
inia
0.84
anian
0.83
iants
0.83
Koz
0.80
illo
0.79
Alexander
0.78
andra
0.77
Activations Density 0.008%