INDEX
Explanations
instances where the text suggests the presence of a specific issue or matter needing attention or resolution
statements indicating the existence or occurrence of something
New Auto-Interp
Negative Logits
chains
-0.69
iaries
-0.68
predecessors
-0.64
segments
-0.63
ilst
-0.62
verse
-0.61
nels
-0.61
receipt
-0.60
heels
-0.60
earners
-0.59
POSITIVE LOGITS
happening
1.03
transpired
0.77
forgiven
0.74
missing
0.73
wrong
0.72
gonna
0.71
done
0.71
bothering
0.70
going
0.69
cov
0.69
Activations Density 0.119%