INDEX
Explanations
opinions or evaluations from a specific standpoint
phrases that indicate various viewpoints or perspectives
New Auto-Interp
Negative Logits
olesc
-0.80
osponsors
-0.78
RANT
-0.74
arus
-0.69
woods
-0.66
strom
-0.66
usters
-0.62
pload
-0.62
kered
-0.62
TIME
-0.62
POSITIVE LOGITS
standpoint
0.94
alone
0.84
onwards
0.74
anyway
0.73
perspective
0.72
:
0.71
,
0.71
alike
0.70
aside
0.69
mma
0.67
Activations Density 0.101%