INDEX
Explanations
words related to urban environments
terms related to urban environments and infrastructure
New Auto-Interp
Negative Logits
creen
-0.84
aurus
-0.79
xon
-0.73
CHR
-0.70
lder
-0.69
ippi
-0.68
prus
-0.68
zona
-0.67
ertodd
-0.66
xit
-0.66
POSITIVE LOGITS
urb
1.00
ulent
0.92
inators
0.91
abies
0.91
ulence
0.91
inator
0.86
inson
0.84
inating
0.75
idge
0.75
ruary
0.75
Activations Density 0.022%