INDEX
Explanations
phrases related to the concept of 'living' or existence
New Auto-Interp
Negative Logits
pling
-0.17
.nz
-0.16
utsch
-0.15
eut
-0.15
wart
-0.15
aly
-0.14
ucid
-0.14
ovies
-0.14
list
-0.14
ificial
-0.13
POSITIVE LOGITS
ardo
0.18
boat
0.15
blood
0.15
/work
0.15
-threatening
0.15
/shop
0.15
urm
0.14
illisecond
0.14
trữ
0.14
.opensource
0.14
Activations Density 0.052%