INDEX
Explanations
quotations within text
occurrences of commas in sentences
New Auto-Interp
Negative Logits
erity
-0.62
Settlement
-0.59
Holy
-0.57
seams
-0.57
proof
-0.57
Next
-0.57
trouble
-0.56
Maker
-0.56
DonaldTrump
-0.55
irie
-0.55
POSITIVE LOGITS
adding
1.13
referring
1.01
citing
1.00
noting
0.98
echoing
0.97
quoting
0.93
recalling
0.91
emphasizing
0.91
stressing
0.87
albeit
0.85
Activations Density 0.123%