INDEX
Explanations
expressions related to personal opinions or perspectives
expressions of personal feelings and opinions
New Auto-Interp
Negative Logits
emale
-0.68
ornia
-0.63
itled
-0.61
atel
-0.58
*)
-0.58
SHARES
-0.57
utterstock
-0.57
Previous
-0.56
penned
-0.55
thouse
-0.55
POSITIVE LOGITS
.'"
1.09
.''
1.02
)."
1.00
."[
1.00
."
0.98
]."
0.96
â̦"
0.90
'."
0.87
mathemat
0.85
.''.
0.81
Activations Density 0.838%