INDEX
Explanations
phrases or terms related to computer files and programming languages
occurrences of the substring "pt"
New Auto-Interp
Negative Logits
heads
-0.66
beit
-0.65
Bey
-0.64
HAHA
-0.63
BIP
-0.63
resil
-0.62
HAHAHAHA
-0.62
WAY
-0.62
Bravo
-0.62
CE
-0.61
POSITIVE LOGITS
itude
1.10
rees
1.03
icket
0.98
ember
0.97
sis
0.97
acular
0.96
itudes
0.90
uple
0.90
ree
0.90
raction
0.89
Activations Density 0.036%