INDEX
Explanations
mentions of important or influential individuals
titles or roles of prominent individuals, specifically those in leadership or professional positions
New Auto-Interp
Negative Logits
edIn
-0.73
ilver
-0.70
Members
-0.69
ourcing
-0.67
Bound
-0.66
english
-0.64
Length
-0.64
etimes
-0.63
اÙĦ
-0.63
Definitions
-0.61
POSITIVE LOGITS
himself
0.86
's
0.78
osphere
0.77
iest
0.75
liest
0.70
opted
0.69
stown
0.68
laureate
0.68
whom
0.68
owicz
0.68
Activations Density 0.294%