INDEX
Explanations
mentions of actions affecting abilities or actions with significant consequences on a larger scale
phrases related to the ability or capacity of entities or individuals to act or operate
New Auto-Interp
Negative Logits
wcsstore
-0.82
lich
-0.72
Aware
-0.67
Streamer
-0.66
Instruct
-0.64
Redditor
-0.62
Forums
-0.61
abe
-0.60
CHAT
-0.60
uggest
-0.59
POSITIVE LOGITS
altogether
1.04
livelihood
0.87
limb
0.84
entire
0.83
inhib
0.82
prematurely
0.80
cherished
0.78
essential
0.76
entirely
0.76
unnecessarily
0.74
Activations Density 0.899%