INDEX
Explanations
references to non-reflective materials or concepts
the term "non" in various contexts
New Auto-Interp
Negative Logits
Tycoon
-0.96
Oaks
-0.67
Seasons
-0.64
Takes
-0.62
downfall
-0.61
guts
-0.61
Grill
-0.60
IUM
-0.60
trove
-0.59
Learns
-0.59
POSITIVE LOGITS
chal
1.33
linear
1.32
verbal
1.28
stop
1.22
lethal
1.22
fiction
1.21
etheless
1.17
violence
1.17
compliance
1.17
binary
1.15
Activations Density 0.028%