INDEX
Explanations
blocks of code or structured data formats
New Auto-Interp
Negative Logits
Knife
-0.16
egan
-0.16
adx
-0.16
Knife
-0.15
567
-0.15
-0.15
agan
-0.15
-0.14
happiness
-0.14
Kag
-0.14
POSITIVE LOGITS
0.42
0.30
0.27
0.27
19
0.23
0.23
0.22
****************************
0.22
0.21
0.21
Activations Density 0.018%