INDEX
Explanations
references to tables in the text
table references
New Auto-Interp
Negative Logits
BDS
-0.43
peda
-0.41
Aussie
-0.41
koala
-0.41
insiders
-0.40
himo
-0.40
ODS
-0.39
demik
-0.39
GHS
-0.39
RSI
-0.39
POSITIVE LOGITS
Table
3.16
Table
2.61
TABLE
2.09
table
2.08
Tables
2.02
tables
1.65
TABLE
1.63
Tables
1.62
getTable
1.51
TABLES
1.49
Activations Density 0.016%