INDEX
Explanations
references to the term "King"
references to the word "King."
New Auto-Interp
Negative Logits
eria
-0.81
TING
-0.78
ters
-0.77
uated
-0.73
ATIONAL
-0.73
ciplinary
-0.72
ename
-0.71
Ö¼
-0.70
ission
-0.69
arial
-0.68
POSITIVE LOGITS
pin
1.26
uin
1.15
doms
1.13
pins
1.11
dom
1.10
fish
1.01
DOM
0.97
Kong
0.94
stown
0.90
Clancy
0.89
Activations Density 0.014%