INDEX
Explanations
proper nouns such as names of people, organizations, and locations
references to specific organizations and institutions
New Auto-Interp
Negative Logits
=-=-=-=-
-0.70
bryce
-0.62
tomorrow
-0.60
beforehand
-0.60
ASAP
-0.60
recent
-0.59
:,
-0.59
particulars
-0.59
âĶģ
-0.58
complying
-0.57
POSITIVE LOGITS
began
1.14
became
1.14
underwent
1.04
reverted
1.03
wrote
1.02
debuted
1.01
finally
1.00
abruptly
1.00
embarked
0.99
undertook
0.98
Activations Density 0.482%