INDEX
Explanations
instances where the document mentions the act of thinking or reflecting
discussions centered around the concept of thinking
New Auto-Interp
Negative Logits
shale
-0.66
catentry
-0.65
bri
-0.62
waters
-0.58
contamin
-0.58
CBC
-0.57
key
-0.57
abolished
-0.57
ague
-0.55
videos
-0.55
POSITIVE LOGITS
orial
0.82
Turing
0.77
-|
0.77
pad
0.76
lift
0.72
ortment
0.70
onymous
0.70
xiety
0.69
aloud
0.68
iste
0.68
Activations Density 0.046%