INDEX
Explanations
mentions of the name "Hassan."
New Auto-Interp
Negative Logits
idge
-0.74
lying
-0.72
mented
-0.71
LOAD
-0.70
çĦ
-0.69
FH
-0.66
ground
-0.66
tted
-0.65
amina
-0.64
chest
-0.64
POSITIVE LOGITS
Rouhani
1.15
Whites
0.99
Hassan
0.92
atos
0.87
etics
0.85
Nas
0.84
ascus
0.82
anth
0.81
daq
0.80
xual
0.80
Activations Density 0.004%