INDEX
Explanations
phrases related to scientific research and experimentation
New Auto-Interp
Negative Logits
DX
-0.56
cia
-0.54
agon
-0.53
incent
-0.52
atten
-0.51
rises
-0.51
Ĥİ
-0.51
itans
-0.50
bending
-0.50
ibble
-0.49
POSITIVE LOGITS
thereby
0.74
consequently
0.73
thence
0.73
then
0.67
thus
0.66
vice
0.65
therefore
0.64
possibly
0.63
preferably
0.62
hence
0.61
Activations Density 18.660%