INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    RegressionTest
    -0.60
    Hentet
    -0.58
     Ooster
    -0.56
     betweenstory
    -0.55
     Filling
    -0.55
     Fill
    -0.53
     يتيمه
    -0.52
     filling
    -0.50
    #
    -0.49
    -0.49
    POSITIVE LOGITS
     dissolve
    0.68
     unc
    0.66
     thin
    0.65
     break
    0.65
     loosen
    0.64
     clear
    0.60
     dis
    0.59
     liqu
    0.59
     Dissolve
    0.59
     melt
    0.57
    Act Density 0.000%

    No Known Activations