INDEX
Explanations
words related to official titles or affiliations
instances of acronyms or abbreviations
New Auto-Interp
Negative Logits
Magikarp
-0.62
Archangel
-0.60
headers
-0.59
Gun
-0.59
frey
-0.58
darts
-0.58
wings
-0.57
SOURCE
-0.57
Pearl
-0.57
blast
-0.56
POSITIVE LOGITS
cific
0.98
pta
0.83
ificant
0.82
uration
0.79
utes
0.77
ceed
0.76
encies
0.75
acist
0.72
ribing
0.72
ccess
0.71
Activations Density 0.061%