INDEX
Explanations
elements related to introductions and beginnings of discussions
New Auto-Interp
Negative Logits
ivia
-0.15
åıİ
-0.14
ooter
-0.14
hani
-0.13
اÙĪÛĮ
-0.13
تاب
-0.13
jid
-0.13
Doyle
-0.13
볬
-0.12
avra
-0.12
POSITIVE LOGITS
introduction
1.15
introdu
1.10
Introduction
1.01
intro
0.93
Introduction
0.93
introduce
0.93
introduced
0.88
introducing
0.85
Intro
0.84
introductory
0.81
Activations Density 0.364%