INDEX
Explanations
phrases containing the word "there"
repetitions of the word "there"
New Auto-Interp
Negative Logits
cogn
-0.62
favour
-0.62
ME
-0.61
CJ
-0.59
franc
-0.58
tolerant
-0.56
cade
-0.56
Franc
-0.55
uously
-0.55
icial
-0.55
POSITIVE LOGITS
abouts
1.31
upon
1.04
ngth
0.90
fore
0.84
geist
0.78
FORE
0.77
after
0.74
ichick
0.73
hovah
0.73
ain
0.69
Activations Density 0.130%