INDEX
Explanations
quotations with statements of personal opinion
statements or claims that emphasize the significance or intensity of a situation or event
New Auto-Interp
Negative Logits
?).
-0.79
.).
-0.68
).
-0.63
+.
-0.63
().
-0.62
odox
-0.62
.*
-0.60
respectively
-0.59
cum
-0.58
dq
-0.58
POSITIVE LOGITS
%"
1.15
[
1.11
,"
1.01
[/
0.89
.,"
0.87
":
0.82
"]
0.81
,'"
0.79
â̦"
0.78
!"
0.77
Activations Density 1.038%