INDEX
Explanations
personal opinions and perspectives
expressions of personal opinion or perspective
New Auto-Interp
Negative Logits
raviolet
-0.72
lycer
-0.68
flush
-0.68
roup
-0.65
tin
-0.63
rium
-0.62
batch
-0.61
perty
-0.59
Combine
-0.59
wait
-0.59
POSITIVE LOGITS
personally
1.06
adows
0.92
adow
0.92
OGR
0.82
Interstitial
0.80
cca
0.78
©¶æ¥µ
0.72
xc
0.69
asured
0.68
myself
0.67
Activations Density 0.044%