INDEX
Explanations
mentions of a specific name "Mahdi"
references to the word "diabetes."
New Auto-Interp
Negative Logits
tty
-0.69
ELY
-0.67
tails
-0.66
orship
-0.66
olves
-0.64
UT
-0.63
Rapids
-0.63
Franchise
-0.62
Loud
-0.61
heads
-0.61
POSITIVE LOGITS
abetes
1.00
plom
0.97
ãĥĻ
0.91
urnal
0.88
DragonMagazine
0.83
pling
0.81
ploma
0.80
PsyNetMessage
0.79
agonal
0.78
hammad
0.76
Activations Density 0.031%