INDEX
Explanations
phrases indicating quoted speech or reported statements
phrases that indicate the reporting or stating of information
New Auto-Interp
Negative Logits
twitch
-0.68
æĸ
-0.63
ä½ľ
-0.62
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.62
ãĥĩ
-0.61
spread
-0.60
Talent
-0.60
ivot
-0.59
è¯
-0.57
sv
-0.57
POSITIVE LOGITS
:"
0.77
"...
0.75
omin
0.67
"â̦
0.66
authors
0.65
"(
0.64
:-
0.61
quoting
0.60
lishes
0.59
:'
0.59
Activations Density 0.199%