INDEX
    Explanations

    phrases expressing equivalence or comparisons

    comparisons indicating equivalence in various contexts

    New Auto-Interp
    Negative Logits
    stal
    -0.82
    hra
    -0.77
    spe
    -0.76
    stra
    -0.74
    oard
    -0.74
     Bomb
    -0.68
    omen
    -0.67
    bean
    -0.67
     Roads
    -0.65
     Mush
    -0.65
    POSITIVE LOGITS
    ivalent
    0.84
    lihood
    0.83
    isons
    0.82
    imately
    0.80
     amounts
    0.77
    terday
    0.77
    icut
    0.74
    aminer
    0.72
     equivalent
    0.72
    oreal
    0.72
    Act Density 0.016%

    No Known Activations