INDEX
Explanations
words related to accidents, disasters, and general negative events
coordinating conjunctions and phrases signifying processes or actions
New Auto-Interp
Negative Logits
abs
-0.78
scribe
-0.72
rack
-0.70
cho
-0.68
ivot
-0.67
dding
-0.66
apest
-0.66
Think
-0.66
ãĥĥãĤ¯
-0.66
irting
-0.65
POSITIVE LOGITS
resulted
1.47
lasted
1.47
ended
1.39
remained
1.36
became
1.32
culminated
1.31
stayed
1.23
plummeted
1.22
was
1.21
proceeded
1.21
Activations Density 0.265%