INDEX
Explanations
mentions of the word "ob" with varying degrees of activation
occurrences of the word "ob" in varying contexts
New Auto-Interp
Negative Logits
ivities
-0.77
³³³³³³³³
-0.70
Quan
-0.69
cort
-0.67
imates
-0.66
³³³³
-0.64
Wr
-0.61
ORIG
-0.60
++++++++++++++++
-0.59
Actual
-0.59
POSITIVE LOGITS
acter
1.39
ranch
1.24
server
1.20
edience
1.20
esity
1.19
urden
1.19
acterial
1.18
acteria
1.17
last
1.13
livion
1.10
Activations Density 0.031%