INDEX
Explanations
phrases related to the strength or resilience of something, often with the word "the" preceding it
frequent use of the word "the"
New Auto-Interp
Negative Logits
vl
-0.83
autions
-0.79
imity
-0.77
icia
-0.77
erity
-0.75
fulness
-0.74
ptions
-0.71
abel
-0.70
worth
-0.70
earance
-0.70
POSITIVE LOGITS
respective
0.92
entire
0.88
cosmos
0.88
individual
0.88
profession
0.87
preceding
0.86
sexes
0.85
wearer
0.85
nation
0.85
universe
0.85
Activations Density 0.310%