INDEX
    Explanations

    phrases related to knowledge or stating facts

    New Auto-Interp
    Negative Logits
     optik
    -0.69
     kapital
    -0.63
     kristal
    -0.60
     silikon
    -0.60
     adal
    -0.60
     etik
    -0.58
     keramik
    -0.58
     ekster
    -0.56
     kilomet
    -0.55
     alkoh
    -0.55
    POSITIVE LOGITS
     disreg
    0.89
     know
    0.81
     shenan
    0.80
     unspeak
    0.80
    know
    0.78
     KNOW
    0.76
     knows
    0.72
     quivering
    0.71
     impra
    0.70
     Know
    0.69
    Act Density 0.074%

    No Known Activations