INDEX
Explanations
tokens indicating a new sentence or the start of a new text segment
the indefinite article "a" in various contexts
New Auto-Interp
Negative Logits
assisted
-0.89
breakers
-0.77
anism
-0.75
enance
-0.74
OWS
-0.74
eyes
-0.73
tags
-0.72
friends
-0.71
rates
-0.70
ares
-0.69
POSITIVE LOGITS
lot
1.31
possibility
1.21
shortage
1.21
tendency
1.19
plethora
1.18
definite
1.06
discrepancy
1.02
tremendous
1.02
LOT
1.01
chance
1.01
Activations Density 0.093%