INDEX
Explanations
references to leakage or leaks in various contexts
New Auto-Interp
Negative Logits
mtext
-0.67
Bronson
-0.63
DCC
-0.61
terang
-0.60
mathbf
-0.59
Tro
-0.57
Cartwright
-0.56
Desc
-0.56
reckoning
-0.56
+#+
-0.54
POSITIVE LOGITS
leaks
1.35
leak
1.30
Leak
1.30
leakage
1.27
Leaks
1.23
Leakage
1.19
leak
1.18
Leak
1.10
Leaks
1.10
leaking
1.05
Activations Density 0.006%