INDEX
Explanations
URL-like patterns within the text
comments or annotations related to code or scripts
New Auto-Interp
Negative Logits
ĺħ
-0.98
phrine
-0.83
ylum
-0.78
livest
-0.77
subsequ
-0.75
xual
-0.74
confir
-0.72
strut
-0.71
tremend
-0.71
tsunami
-0.70
POSITIVE LOGITS
////////////////
1.37
////////////////////////////////
1.28
////////
1.25
++++++++++++++++
1.06
////
1.01
âĢ¢âĢ¢âĢ¢âĢ¢
0.90
================================================================
0.85
wcsstore
0.84
ãĤ§
0.83
================================
0.79
Activations Density 0.014%