INDEX
Explanations
the mention of individuals named David
New Auto-Interp
Negative Logits
titious
-0.67
ràng
-0.66
Ohl
-0.66
Besch
-0.63
*~*~
-0.63
Hör
-0.62
../
-0.62
Lugo
-0.61
Butyl
-0.61
Lleg
-0.60
POSITIVE LOGITS
DAVID
1.07
Davids
1.04
bridges
1.03
Bridges
1.03
Meksiku
1.02
David
1.02
Bowie
1.00
David
0.99
DAVID
0.98
bridges
0.98
Activations Density 0.073%