INDEX
Explanations
references and citations in scientific texts
New Auto-Interp
Negative Logits
lift
-0.35
Gate
-0.34
twimg
-0.33
autorytatywna
-0.33
MID
-0.31
lifts
-0.31
Portale
-0.31
gate
-0.31
texttt
-0.30
CountDown
-0.30
POSITIVE LOGITS
kasarigan
0.79
otomatig
0.56
دانشنامهٔ
0.55
HasFactory
0.52
orteur
0.52
ſeinen
0.52
icksburg
0.51
ſei
0.50
Geſch
0.50
grés
0.50
Activations Density 1.725%