INDEX
Explanations
references to British royalty, specifically Queen Victoria and her family associations
historical figures and street names
New Auto-Interp
Negative Logits
المشاركات
-0.66
featureID
-0.63
########.
-0.60
-0.58
AndEndTag
-0.57
surate
-0.54
orsese
-0.54
Билгалдахарш
-0.54
يكب
-0.53
nobyl
-0.53
POSITIVE LOGITS
Autoritní
0.32
nonUne
0.32
tst
0.30
Street
0.29
Crescent
0.29
seseorang
0.28
0.28
crescent
0.27
("")]
0.26
oef
0.26
Activations Density 0.041%