INDEX
Explanations
phrases concerning the legal use and distribution of material
phrases related to content usage restrictions and copyright
New Auto-Interp
Negative Logits
mens
-0.68
Parables
-0.61
jen
-0.59
achine
-0.59
mons
-0.57
Carnage
-0.56
hers
-0.55
oad
-0.54
occ
-0.53
Hare
-0.53
POSITIVE LOGITS
rewritten
1.27
archived
0.90
redistributed
0.88
reprinted
0.78
written
0.75
transmitted
0.75
reproduced
0.74
published
0.74
photographed
0.74
copied
0.74
Activations Density 0.044%