INDEX
    Explanations

    references to temperature

    New Auto-Interp
    Negative Logits
     kasarigan
    -1.09
     فريبيس
    -0.94
     reaſon
    -0.90
     itſelf
    -0.90
     uſe
    -0.89
     themſelves
    -0.87
     Theſe
    -0.87
     purpoſe
    -0.86
     uſed
    -0.86
     pleaſure
    -0.86
    POSITIVE LOGITS
    y
    0.81
    e
    0.77
     Charlie
    0.73
    Charlie
    0.67
     Lang
    0.66
     &
    0.65
    um
    0.65
     McIntyre
    0.63
    AssignableFrom
    0.63
    tem
    0.62
    Act Density 0.017%

    No Known Activations