INDEX
Explanations
quotations from or about individuals, particularly regarding personal philosophies or reflections
New Auto-Interp
Negative Logits
synerg
-0.77
knockout
-0.74
regenerate
-0.74
equival
-0.74
distilled
-0.73
endeav
-0.72
undet
-0.72
pse
-0.72
oppressed
-0.71
citiz
-0.71
POSITIVE LOGITS
Indeed
1.55
Asked
1.55
Others
1.45
Newsletter
1.44
Another
1.43
Refer
1.42
Despite
1.42
Added
1.41
Although
1.39
However
1.39
Activations Density 0.274%