INDEX
Explanations
words related to organization names or entities
abbreviations and acronyms
New Auto-Interp
Negative Logits
Liberties
-0.70
Herm
-0.64
Vag
-0.63
Virus
-0.63
nausea
-0.62
=-=-=-=-
-0.61
Johns
-0.60
Lara
-0.60
pretext
-0.60
Citizenship
-0.59
POSITIVE LOGITS
ITION
1.36
ITS
1.32
ISH
1.32
FORM
1.27
WORK
1.27
ANT
1.26
IUM
1.25
NER
1.24
REC
1.22
LET
1.21
Activations Density 0.186%