INDEX
Explanations
enumerated lists and items in a structured format
New Auto-Interp
Negative Logits
gne
-0.60
agna
-0.58
onData
-0.57
CloseOperation
-0.57
Vigo
-0.56
whit
-0.55
sab
-0.54
Palest
-0.53
gim
-0.52
SHA
-0.52
POSITIVE LOGITS
ii
2.86
iii
2.20
ii
2.00
iiiii
1.71
iiii
1.66
iii
1.53
vii
1.32
ⅱ
1.26
iiiiiiii
1.23
sii
1.22
Activations Density 0.096%