INDEX
Explanations
statements containing the word "According."
New Auto-Interp
Negative Logits
IFT
-0.69
eg
-0.66
ql
-0.65
affe
-0.64
igmat
-0.63
obyl
-0.60
riage
-0.59
oppable
-0.59
estern
-0.59
apons
-0.58
POSITIVE LOGITS
sources
0.80
SOURCE
0.73
Sources
0.70
translation
0.68
ly
0.67
edly
0.66
Ĥİ
0.66
Rank
0.65
tains
0.64
tained
0.63
Activations Density 0.355%