INDEX
Explanations
instances of the word "as" used for comparisons or similes
New Auto-Interp
Negative Logits
çīĪ
-0.65
oller
-0.65
OSE
-0.54
Mandatory
-0.54
acent
-0.54
Mellon
-0.53
lockdown
-0.51
lish
-0.50
CCC
-0.49
Noble
-0.49
POSITIVE LOGITS
regards
0.95
phy
0.94
pects
0.91
evidenced
0.90
par
0.87
opposed
0.85
bestos
0.83
well
0.82
fuck
0.81
phalt
0.81
Activations Density 0.101%