INDEX
Explanations
references to a person named Andre
the name "Andre" in various contexts
New Auto-Interp
Negative Logits
inct
-1.00
ulhu
-0.80
ead
-0.75
stakes
-0.70
dfx
-0.69
lished
-0.68
lishing
-0.68
plain
-0.66
ointed
-0.65
clips
-0.65
POSITIVE LOGITS
tti
1.14
essen
0.95
byss
0.87
cats
0.81
XIII
0.78
Paste
0.78
Vu
0.75
Damon
0.74
Andre
0.73
Rus
0.70
Activations Density 0.031%