INDEX
Explanations
words related to location, particularly where someone is originally from
instances of origin or place of belonging
New Auto-Interp
Negative Logits
idav
-0.67
ercise
-0.67
aminer
-0.65
awatts
-0.63
EPA
-0.61
needed
-0.60
Assistant
-0.59
awaited
-0.59
releases
-0.58
release
-0.58
POSITIVE LOGITS
afar
0.87
abroad
0.83
rural
0.79
nowhere
0.78
humble
0.77
poorer
0.76
angu
0.75
overseas
0.74
affluent
0.74
wealthy
0.72
Activations Density 0.104%