INDEX
Explanations
proper nouns related to specific locations or people
references to specific locations and entities related to events
New Auto-Interp
Negative Logits
rac
-0.78
opian
-0.76
culus
-0.74
ascus
-0.71
linkage
-0.71
arg
-0.70
saddle
-0.69
dynam
-0.69
packing
-0.69
orbital
-0.69
POSITIVE LOGITS
To
1.30
Like
1.30
Of
1.28
Where
1.26
Not
1.26
With
1.25
Again
1.25
Twice
1.25
That
1.24
Were
1.24
Activations Density 0.332%