INDEX
Explanations
mentions of the human body
mentions of the term 'body.'
New Auto-Interp
Negative Logits
Kafka
-0.77
Clover
-0.66
Nex
-0.65
Dickens
-0.65
BILITY
-0.64
Hoover
-0.64
Ans
-0.64
Monthly
-0.62
Liberty
-0.61
PBS
-0.61
POSITIVE LOGITS
guards
1.13
anguage
1.04
builder
0.99
builders
0.97
body
0.97
fat
0.92
weight
0.90
body
0.89
guard
0.89
parts
0.86
Activations Density 0.021%