INDEX
Explanations
references to personal experiences and emotions related to safety and support
New Auto-Interp
Negative Logits
éĤ£æł·
-0.16
adil
-0.14
éĤ£
-0.14
öyle
-0.14
éĤ£
-0.13
ãģ¨ãģĵãĤį
-0.13
those
-0.13
arters
-0.13
FindObjectOfType
-0.13
ÑĤомÑĥ
-0.13
POSITIVE LOGITS
this
0.82
this
0.71
(this
0.61
=this
0.60
nÃły
0.60
,this
0.59
this
0.59
questa
0.57
THIS
0.56
[this
0.56
Activations Density 0.998%