INDEX
Explanations
references to personal experiences and connections with nature
New Auto-Interp
Negative Logits
Za
-0.15
sme
-0.15
rav
-0.15
avax
-0.15
Ñĥл
-0.15
í
-0.14
麻
-0.14
SRC
-0.13
zilla
-0.13
Bij
-0.13
POSITIVE LOGITS
Fi
0.32
Fi
0.27
fi
0.26
DOC
0.26
Mil
0.26
Doub
0.25
Mil
0.23
Milf
0.22
-Fi
0.22
MILF
0.20
Activations Density 0.006%