INDEX
Explanations
locations or places mentioned in the text
the word "in" across various contexts
New Auto-Interp
Negative Logits
Upload
-0.64
giving
-0.60
critics
-0.59
nell
-0.59
76561
-0.57
enaries
-0.55
linger
-0.54
chops
-0.54
detractors
-0.54
¿½
-0.54
POSITIVE LOGITS
sight
0.98
existence
0.94
between
0.93
society
0.87
animate
0.83
planet
0.81
life
0.81
heaven
0.79
between
0.78
history
0.77
Activations Density 0.116%