INDEX
Explanations
proper nouns or names (e.g., Susie, Susana, Suzette) in a text
references to a specific character or name related to suspicion or scrutiny
New Auto-Interp
Negative Logits
è¦ļéĨĴ
-0.94
akings
-0.68
Doomsday
-0.65
sorting
-0.64
dividing
-0.64
overhead
-0.64
separating
-0.63
stakes
-0.63
Kinnikuman
-0.63
living
-0.62
POSITIVE LOGITS
pected
1.33
pect
1.30
pects
1.23
pecting
1.21
pic
1.21
annah
1.14
pir
1.14
cept
1.01
cription
0.91
sex
0.91
Activations Density 0.022%