INDEX
Explanations
terms related to introductory sections or written pieces
instances of the word "introduction."
New Auto-Interp
Negative Logits
esthetic
-0.72
gem
-0.70
robe
-0.70
mic
-0.70
yss
-0.70
rone
-0.68
roph
-0.68
rage
-0.67
hap
-0.67
ramid
-0.64
POSITIVE LOGITS
Takeru
0.78
ablishment
0.77
ounter
0.76
introduction
0.75
urated
0.74
introdu
0.73
prise
0.73
APR
0.72
lishes
0.72
introducing
0.70
Activations Density 0.012%