INDEX
Explanations
the name "Alex" with varying levels of relevance
occurrences of the name "Alex."
New Auto-Interp
Negative Logits
fare
-0.72
stakes
-0.68
recy
-0.67
final
-0.66
unity
-0.66
purpose
-0.66
liness
-0.65
primary
-0.64
enegger
-0.64
draft
-0.63
POSITIVE LOGITS
inia
0.93
iev
0.82
opoulos
0.82
ei
0.81
andra
0.79
orean
0.79
anian
0.78
ulia
0.78
Anton
0.78
illo
0.78
Activations Density 0.010%