INDEX
Explanations
mentions of specific websites or domains, particularly from Microsoft and Mozilla
New Auto-Interp
Negative Logits
akens
-0.15
URED
-0.14
iva
-0.14
******************************************************************************↵
-0.14
ames
-0.14
.eq
-0.14
dam
-0.14
mini
-0.14
/Linux
-0.13
iec
-0.13
POSITIVE LOGITS
/gif
0.16
513
0.16
Įĵ
0.15
Fav
0.15
fireEvent
0.15
ãģķãĤĵãģĮ
0.15
inding
0.15
à¹Ħà¸Ĺย
0.14
rell
0.14
ãģķãĤĵãģ¯
0.14
Activations Density 0.004%