INDEX
    Explanations

    key concepts related to evidence and its credibility

    preceding "is" or "are"

    concept nouns followed by 'is'

    New Auto-Interp
    Negative Logits
    도록
    -0.55
    算了
    -0.52
     rues
    -0.52
     Has
    -0.51
     yet
    -0.49
     match
    -0.49
    まるで
    -0.48
     coincidence
    -0.48
    かったので
    -0.47
    commons
    -0.47
    POSITIVE LOGITS
     typically
    0.79
    と聞
    0.79
     needn
    0.78
     can
    0.75
    seamnă
    0.75
     generally
    0.74
     usually
    0.72
     ≠
    0.72
     kän
    0.71
    ValueStyle
    0.71
    Act Density 0.772%

    No Known Activations