INDEX
Explanations
words related to protection and defense
references to protective barriers or defensive mechanisms
New Auto-Interp
Negative Logits
gres
-0.81
iasco
-0.76
pheus
-0.71
ĸļ
-0.69
————————
-0.68
ribune
-0.68
cause
-0.68
lihood
-0.67
Helpful
-0.66
orie
-0.65
POSITIVE LOGITS
heed
0.83
shield
0.79
piercing
0.78
shields
0.78
lain
0.77
absorbing
0.77
buster
0.74
shielded
0.73
shielding
0.71
erected
0.71
Activations Density 0.064%