INDEX
Explanations
proper nouns, specifically the name "Andre"
occurrences of the name "Andre" in various contexts
New Auto-Interp
Negative Logits
inct
-0.72
ointed
-0.69
ulhu
-0.64
eled
-0.64
clips
-0.61
elled
-0.61
stellar
-0.61
dfx
-0.60
ellen
-0.59
stakes
-0.58
POSITIVE LOGITS
tti
1.16
essen
1.05
byss
0.92
cats
0.78
Drum
0.74
3000
0.74
gie
0.72
iev
0.71
Vu
0.69
Damon
0.69
Activations Density 0.026%