INDEX
Explanations
mentions of health conditions, particularly malignancies
terms related to malfunctions or negative conditions
New Auto-Interp
Negative Logits
BOOK
-0.82
æĸ¹
-0.74
FACE
-0.72
ITED
-0.72
DragonMagazine
-0.71
Hobby
-0.71
zzo
-0.70
Carbuncle
-0.69
hetti
-0.69
Solitaire
-0.68
POSITIVE LOGITS
mal
1.15
vulner
1.00
adies
0.96
colm
0.91
ignant
0.88
ciating
0.79
mal
0.79
challeng
0.79
practice
0.78
querade
0.78
Activations Density 0.008%