INDEX
Explanations
words and phrases related to ongoing conflict or disagreement
punctuation marks indicating questions or dialogue
Quotation punctuation
New Auto-Interp
Negative Logits
GroupLayout
-0.64
tartalomajánló
-0.58
createState
-0.57
artiges
-0.52
UDAD
-0.50
WarningLevel
-0.50
IFICA
-0.49
Skocz
-0.48
ANDUM
-0.47
tagens
-0.47
POSITIVE LOGITS
?”
1.06
.”
0.95
),”
0.93
.”)
0.92
?”
0.90
,”
0.90
?
0.87
.”—
0.85
,’”
0.84
?’
0.83
Activations Density 21.411%