INDEX
Explanations
short phrases mentioning groups or individuals and their actions or characteristics
conjunctions and expressions of addition or connection in the text
New Auto-Interp
Negative Logits
"],
-0.72
WORK
-0.68
blogspot
-0.63
hart
-0.61
itiz
-0.59
])
-0.59
Times
-0.58
Alert
-0.58
vec
-0.57
iuses
-0.57
POSITIVE LOGITS
indeed
1.40
consequently
1.33
therefore
1.32
hence
1.25
furthermore
1.25
preferably
1.23
especially
1.20
moreover
1.18
thus
1.14
possibly
1.13
Activations Density 0.180%