INDEX
Explanations
proper nouns or names of people, places, or things, particularly associated with awards, achievements, or governmental entities
phrases related to official awards or recognitions
New Auto-Interp
Negative Logits
boats
-0.76
Magikarp
-0.73
asons
-0.72
igans
-0.72
Jews
-0.71
Students
-0.71
arms
-0.70
Maps
-0.70
uits
-0.69
Links
-0.69
POSITIVE LOGITS
latter
1.05
dreaded
1.05
entire
1.02
venerable
0.97
same
0.95
original
0.94
coveted
0.94
aforementioned
0.92
entirety
0.86
infamous
0.84
Activations Density 0.641%