INDEX
Explanations
mentions of actions, statements, or decisions made by a specific person
references to a specific speaker or subject's statements
New Auto-Interp
Negative Logits
+/-
-0.61
âĺħâĺħ
-0.61
Starg
-0.59
[|
-0.59
guiActiveUnfocused
-0.58
â̦"
-0.56
..."
-0.55
Chaser
-0.55
~~~~
-0.54
___
-0.53
POSITIVE LOGITS
zbollah
1.39
pherd
1.32
resy
1.19
ldon
1.12
ffield
1.11
odore
1.09
ppard
1.06
miah
1.04
idi
1.00
itage
0.99
Activations Density 0.160%