INDEX
Explanations
phrases that indicate a specific timeframe in the past
the article "a" in various contexts
New Auto-Interp
Negative Logits
Catal
-0.84
AIDS
-0.83
Edit
-0.83
culosis
-0.72
EU
-0.71
TPP
-0.71
independence
-0.70
ahime
-0.70
achine
-0.69
evidence
-0.69
POSITIVE LOGITS
handful
1.13
bunch
1.08
lot
0.99
fraction
0.99
tad
0.98
few
0.97
couple
0.96
bit
0.90
temporary
0.89
dozen
0.89
Activations Density 0.130%