INDEX
Explanations
instances where something is observed or experienced directly
references to firsthand experiences and eyewitness accounts
New Auto-Interp
Negative Logits
nan
-0.79
merce
-0.78
corn
-0.77
ramid
-0.77
nam
-0.71
rar
-0.71
rug
-0.69
uyomi
-0.67
erenn
-0.67
ishops
-0.66
POSITIVE LOGITS
firsthand
1.16
ewitness
0.94
eyewitness
0.81
Effects
0.73
attest
0.69
testimonies
0.68
VIDEOS
0.66
Explorer
0.64
witness
0.62
perspect
0.62
Activations Density 0.009%