INDEX
Explanations
phrases indicating the origin or source of content
instances of the word "Originally."
New Auto-Interp
Negative Logits
failure
-0.70
statistical
-0.66
drive
-0.65
rout
-0.63
fault
-0.62
computer
-0.62
impl
-0.61
pressures
-0.61
failures
-0.61
repeat
-0.61
POSITIVE LOGITS
Originally
3.76
Originally
3.25
Initially
1.49
Initially
1.47
Previously
1.44
originally
1.39
Previously
1.35
Original
1.22
orig
1.17
Currently
1.06
Activations Density 0.016%