INDEX
Explanations
references to specific entities or organizations mentioned in a context of news or events
instances of the word "the"
New Auto-Interp
Negative Logits
thood
-0.78
RIC
-0.77
lement
-0.73
beard
-0.73
bear
-0.71
aurus
-0.69
Iterator
-0.68
lessly
-0.68
upon
-0.67
ties
-0.66
POSITIVE LOGITS
periphery
1.22
basis
1.19
occasion
1.17
heels
1.16
same
1.14
sidelines
1.14
brink
1.10
verge
1.06
forefront
1.04
premise
1.04
Activations Density 0.223%