INDEX
Explanations
proper nouns
occurrences of the substring "gin"
New Auto-Interp
Negative Logits
Attempts
-0.68
elapsed
-0.64
punishment
-0.64
HHS
-0.63
unintended
-0.62
supervisors
-0.62
anyahu
-0.61
plaintiff
-0.61
isites
-0.60
District
-0.59
POSITIVE LOGITS
gin
1.48
uine
1.06
forth
1.03
atin
0.94
uin
0.94
fo
0.92
Ò
0.90
etta
0.90
herer
0.87
atus
0.86
Activations Density 0.007%