INDEX
Explanations
proper nouns such as names of companies or organizations
acronyms, abbreviations, or initialisms related to various organizations or topics
New Auto-Interp
Negative Logits
enegger
-0.93
disputed
-0.83
bottleneck
-0.83
stroke
-0.77
dividing
-0.75
recess
-0.75
confined
-0.75
charge
-0.74
length
-0.73
substance
-0.71
POSITIVE LOGITS
Fest
1.03
GN
0.99
Forge
0.97
FM
0.96
Week
0.95
GF
0.95
WN
0.93
BW
0.92
TV
0.92
OTA
0.91
Activations Density 0.396%