INDEX
    Explanations

    conclusions or implications based on given information

    the word "therefore" indicating logical conclusions or reasoning

    New Auto-Interp
    Negative Logits
     Defenders
    -0.69
    Fram
    -0.66
     Feld
    -0.64
     Debor
    -0.63
     Yel
    -0.63
     Patty
    -0.62
     Franklin
    -0.62
    MM
    -0.61
     Ott
    -0.60
     Vancouver
    -0.60
    POSITIVE LOGITS
    forth
    1.16
    ettings
    0.84
    uracy
    0.82
     facto
    0.79
    elist
    0.78
     guiActiveUn
    0.77
    ãģ®éŃĶ
    0.76
     necess
    0.76
     theoretically
    0.75
    odynam
    0.74
    Act Density 0.020%

    No Known Activations