INDEX
Explanations
statements conveying observations or perceptions
New Auto-Interp
Negative Logits
pmwiki
-0.75
ItemTracker
-0.71
Contents
-0.68
Kin
-0.67
heres
-0.64
urdue
-0.64
ugal
-0.61
zinski
-0.60
Main
-0.59
BLE
-0.59
POSITIVE LOGITS
tale
0.99
ingly
0.76
apart
0.69
ĵĺ
0.67
eth
0.65
eret
0.64
uyomi
0.63
instinctively
0.63
beforehand
0.63
anecd
0.63
Activations Density 0.034%