INDEX
Explanations
numerical values
specific numeric values or quantifications
New Auto-Interp
Negative Logits
sonian
-0.69
Osw
-0.68
dinand
-0.67
enegger
-0.67
cout
-0.62
tidal
-0.60
dumps
-0.60
cryptoc
-0.60
shire
-0.59
landmark
-0.59
POSITIVE LOGITS
cia
0.95
Flavoring
0.88
oire
0.73
aria
0.71
render
0.70
oft
0.70
yet
0.70
Minecraft
0.69
Filter
0.69
ide
0.68
Activations Density 0.000%