INDEX
Explanations
youtube and website domains
New Auto-Interp
Negative Logits
script
0.58
Subset
0.57
discrete
0.57
automatic
0.56
automatically
0.55
Chunk
0.55
taking
0.55
Darkness
0.54
Script
0.54
take
0.53
POSITIVE LOGITS
com
1.32
org
1.19
gov
1.04
com
1.04
Org
0.97
COM
0.95
ORG
0.93
org
0.92
britann
0.91
gov
0.89
Activations Density 0.563%