INDEX
Explanations
entities or terms related to a specific organization or group
proper nouns, particularly names and places
New Auto-Interp
Negative Logits
OTA
-0.85
eem
-0.76
CAST
-0.74
Mustang
-0.71
Blackburn
-0.69
bomber
-0.67
rets
-0.66
Sharks
-0.66
creen
-0.65
Canary
-0.64
POSITIVE LOGITS
iano
0.82
oulos
0.81
ilogy
0.80
ipel
0.80
DragonMagazine
0.78
ilib
0.77
iscopal
0.76
ographs
0.72
iers
0.72
tuber
0.70
Activations Density 0.022%