INDEX
Explanations
punctuation marks and symbols typically used in casual or informal communication
New Auto-Interp
Negative Logits
declass
-0.70
ricular
-0.62
enza
-0.62
pora
-0.61
ortment
-0.59
AFTA
-0.58
netflix
-0.57
ISBN
-0.56
ulously
-0.56
simul
-0.55
POSITIVE LOGITS
¯¯¯¯
0.96
¯¯¯¯¯¯¯¯
0.80
/)
0.78
')
0.76
////
0.72
\\\\
0.72
¯¯
0.71
slow
0.70
¯
0.70
fe
0.69
Activations Density 0.058%