INDEX
Explanations
names of places or organizations
capital letters and proper nouns
New Auto-Interp
Negative Logits
Redditor
-0.71
Reviewer
-0.66
Gamma
-0.61
orters
-0.58
fusc
-0.58
Britann
-0.57
Zero
-0.57
omics
-0.54
Shape
-0.54
acci
-0.54
POSITIVE LOGITS
VILLE
1.26
CITY
1.13
WASHINGTON
1.10
LAND
1.09
FIELD
1.08
TIT
1.06
BUS
1.05
MAN
1.05
HOU
1.04
MEN
1.02
Activations Density 0.142%