INDEX
Explanations
conversational markers or references to speech
Tokens after a period
overall/analysts
New Auto-Interp
Negative Logits
tvguidetime
-0.74
WebServlet
-0.72
someone
-0.61
ipedia
-0.57
anyone
-0.57
iania
-0.57
everyone
-0.56
somebody
-0.56
extAlignment
-0.55
inguém
-0.54
POSITIVE LOGITS
overall
1.00
Overall
0.98
Overall
0.94
overall
0.87
Analysts
0.85
总体
0.82
analysts
0.81
OVERALL
0.80
Meanwhile
0.77
Analysts
0.76
Activations Density 0.143%