INDEX
Explanations
sentences containing a period
sentences or phrases that indicate actions and events
New Auto-Interp
Negative Logits
ordinary
-0.85
policymakers
-0.77
displacement
-0.77
diseng
-0.76
suppressed
-0.74
patrol
-0.74
employment
-0.74
ordinary
-0.73
depreciation
-0.72
temperament
-0.72
POSITIVE LOGITS
Anyway
1.35
Needless
1.32
Seriously
1.21
Seems
1.20
Basically
1.20
Oops
1.19
Funny
1.18
Hopefully
1.18
Featuring
1.17
Apparently
1.17
Activations Density 0.599%