INDEX
Explanations
words related to organizations or institutions
abbreviations or acronyms related to organizations, reports, and scientific entities
New Auto-Interp
Negative Logits
ado
-0.65
gie
-0.62
Gors
-0.61
Elixir
-0.61
Breitbart
-0.61
puff
-0.60
Esp
-0.59
Bohem
-0.58
Fn
-0.58
compan
-0.58
POSITIVE LOGITS
KI
1.04
WD
0.96
ESS
0.94
FORM
0.93
KT
0.92
ET
0.90
ITE
0.90
RAG
0.89
amily
0.89
HE
0.89
Activations Density 0.186%