INDEX
Explanations
mentions of actions or activities that involve sharing or consuming items
references to specific entities or expressions within the text
New Auto-Interp
Negative Logits
warr
-0.63
Siber
-0.61
sequest
-0.60
exhib
-0.60
deduct
-0.59
Xperia
-0.56
pak
-0.56
Scand
-0.54
ukong
-0.54
imus
-0.53
POSITIVE LOGITS
aughs
0.71
$.
0.67
ï¸ı
0.67
maxwell
0.67
outh
0.66
emetery
0.65
alian
0.65
thia
0.65
osta
0.64
udder
0.64
Activations Density 0.346%