INDEX
Explanations
mentions of specific events or locations
New Auto-Interp
Negative Logits
AAC
-0.71
Ort
-0.70
casting
-0.69
elevation
-0.68
Petro
-0.68
Handbook
-0.68
Britann
-0.68
cutting
-0.67
capacity
-0.66
indirect
-0.66
POSITIVE LOGITS
enough
1.33
shit
1.23
too
1.22
matter
1.17
your
1.17
anything
1.17
needed
1.16
distance
1.14
same
1.14
else
1.14
Activations Density 0.072%