INDEX
Explanations
instances of the word "thinking" followed by a number
instances of the word "thinking."
New Auto-Interp
Negative Logits
çĦ
-0.78
feeding
-0.64
Wrestling
-0.64
CBC
-0.63
Ann
-0.63
any
-0.61
videos
-0.60
clad
-0.60
Videos
-0.59
owship
-0.59
POSITIVE LOGITS
aloud
0.84
provoking
0.81
sonian
0.80
cient
0.78
about
0.74
eteen
0.72
ortment
0.70
lass
0.70
inery
0.69
strategically
0.68
Activations Density 0.033%