INDEX
Explanations
references to various types of pain and discomfort
New Auto-Interp
Negative Logits
+#+#
-0.60
########.
-0.60
GTCX
-0.59
iprot
-0.55
AssemblyCompany
-0.53
SharedCtor
-0.52
tagPool
-0.52
monkey
-0.50
witch
-0.49
galus
-0.49
POSITIVE LOGITS
pain
0.52
romyalgia
0.50
Pain
0.48
douleur
0.48
merzen
0.46
Pain
0.46
dores
0.45
dolor
0.44
douleurs
0.41
Schmerzen
0.41
Activations Density 0.023%