INDEX
Explanations
names of people and locations, particularly in political contexts
New Auto-Interp
Negative Logits
ccording
-0.64
shock
-0.63
amazon
-0.63
ecd
-0.60
tackle
-0.59
spin
-0.59
agra
-0.59
Leaks
-0.59
duino
-0.59
defic
-0.58
POSITIVE LOGITS
,
1.61
.
1.50
;
1.44
).
1.42
:
1.38
.)
1.37
)
1.36
),
1.35
._
1.32
,"
1.32
Activations Density 0.047%