INDEX
Explanations
scientific and pseudo-scientific terms
terms related to scientific disciplines and methodologies
New Auto-Interp
Negative Logits
ILA
-0.77
ecause
-0.73
NCT
-0.71
skirts
-0.71
DRAG
-0.68
mington
-0.68
ABE
-0.66
Emblem
-0.65
drops
-0.63
boats
-0.63
POSITIVE LOGITS
ific
1.44
cient
1.05
ral
0.89
itude
0.85
ificate
0.84
ocratic
0.81
ology
0.81
rily
0.81
ificent
0.80
arily
0.78
Activations Density 0.023%