INDEX
Explanations
instances of the word "thought"
New Auto-Interp
Negative Logits
kw
-0.65
iona
-0.63
ç«
-0.62
Redditor
-0.62
Lv
-0.59
osures
-0.59
conservancy
-0.58
substr
-0.58
Contents
-0.58
Shipping
-0.57
POSITIVE LOGITS
fully
1.05
lessly
0.93
fulness
0.87
ileaks
0.80
ij士
0.77
ought
0.73
stock
0.72
aloud
0.72
inery
0.70
culus
0.69
Activations Density 0.023%