INDEX
Explanations
proper nouns related to personal names and places
words related to the notion of nature and environmental contexts
New Auto-Interp
Negative Logits
inates
-0.76
ittal
-0.72
unity
-0.67
inances
-0.67
inately
-0.66
ublic
-0.65
inated
-0.64
inate
-0.63
heses
-0.63
inating
-0.62
POSITIVE LOGITS
tes
0.77
lis
0.76
bye
0.71
jriwal
0.70
tarian
0.69
leaf
0.68
eers
0.67
rencies
0.66
olesc
0.65
atoes
0.65
Activations Density 0.069%