INDEX
Explanations
proper nouns and locations
references to a specific city or location
New Auto-Interp
Negative Logits
thood
-0.87
20439
-0.84
ccoli
-0.84
γ
-0.81
IQ
-0.80
istries
-0.75
ophen
-0.74
olitics
-0.73
ãĥīãĥ©
-0.73
witch
-0.72
POSITIVE LOGITS
tallest
1.42
adjoining
1.21
roof
1.20
roofs
1.18
entire
1.17
walls
1.15
largest
1.15
interior
1.12
exterior
1.10
oldest
1.10
Activations Density 0.504%