INDEX
Explanations
phrases related to staying away or being kept out of a place or situation
phrases indicating avoidance or exclusion from certain situations or places
New Auto-Interp
Negative Logits
elf
-0.82
olars
-0.79
Ĥİ
-0.75
¾
-0.74
testament
-0.74
¨
-0.71
imen
-0.69
ilib
-0.68
¥
-0.67
antioxid
-0.65
POSITIVE LOGITS
agne
0.71
ned
0.69
altogether
0.68
odox
0.68
bother
0.66
sites
0.66
Ïī
0.65
Trace
0.64
ulnerable
0.63
Scy
0.62
Activations Density 0.050%