INDEX
Explanations
mentions of the name "Bilal" in various contexts
mentions of the name "Bil" in various contexts
New Auto-Interp
Negative Logits
deviation
-0.66
STATE
-0.64
Founding
-0.63
FAM
-0.63
EAR
-0.62
ULT
-0.61
FISA
-0.61
WAR
-0.61
Dangerous
-0.61
é¾įå¥ij士
-0.60
POSITIVE LOGITS
gewater
1.18
Bil
1.06
boa
0.95
ittle
0.95
uto
0.93
una
0.91
adin
0.89
atars
0.88
atar
0.87
quet
0.86
Activations Density 0.008%