INDEX
Explanations
the word "think" or related terms
instances of the word "think" and its variations
New Auto-Interp
Negative Logits
çĦ
-0.68
iations
-0.63
Videos
-0.61
flats
-0.59
Naz
-0.57
iona
-0.56
Merit
-0.56
feeding
-0.55
promoters
-0.55
variance
-0.55
POSITIVE LOGITS
aloud
0.94
about
0.83
xiety
0.81
ABOUT
0.81
cient
0.79
onym
0.78
ileaks
0.78
eteen
0.78
fully
0.74
onymous
0.74
Activations Density 0.063%