INDEX
Explanations
references to a specific term, "Boilerplate"
references to the term "Bo" and its contextual use
New Auto-Interp
Negative Logits
Interstitial
-0.83
orial
-0.82
代
-0.79
terday
-0.76
rity
-0.75
ional
-0.74
UAL
-0.71
ablishment
-0.70
yrinth
-0.68
mary
-0.67
POSITIVE LOGITS
Bo
1.22
Bo
1.07
Bagg
1.05
zz
0.97
zeb
0.95
zzi
0.92
zos
0.90
gey
0.86
iler
0.86
olean
0.85
Activations Density 0.009%