INDEX
Explanations
the phrase "First of all."
the phrase "first of all" or similar introductory phrases
New Auto-Interp
Negative Logits
obal
-0.73
rams
-0.64
orous
-0.63
ularity
-0.61
weeney
-0.61
ores
-0.61
omas
-0.61
chrom
-0.60
arial
-0.60
ADRA
-0.58
POSITIVE LOGITS
foremost
0.99
kin
0.97
all
0.89
equals
0.83
ali
0.82
COUR
0.72
ortunately
0.71
course
0.69
ELY
0.68
necessity
0.65
Activations Density 0.037%