INDEX
Explanations
phrases related to legal issues and controversial events involving potential consequences
instances of punctuation, specifically commas
New Auto-Interp
Negative Logits
atorium
-0.69
ahu
-0.66
wyn
-0.62
ICA
-0.61
iam
-0.59
shore
-0.56
chin
-0.54
ffen
-0.53
USS
-0.52
rar
-0.52
POSITIVE LOGITS
namely
1.31
albeit
1.20
whereas
1.09
despite
1.06
viz
1.04
irrespective
1.00
thereby
0.99
regardless
0.96
albeit
0.93
rather
0.91
Activations Density 0.407%