INDEX
Explanations
pronouns with verbs indicating action towards an object
references to an object or concept being acted upon or discussed repeatedly
New Auto-Interp
Negative Logits
ãĥ©ãĥ³
-0.65
idth
-0.62
Frontier
-0.62
United
-0.58
Governments
-0.58
Mans
-0.57
mission
-0.57
Telecommunications
-0.56
Foreign
-0.56
Geneva
-0.54
POSITIVE LOGITS
alian
1.37
self
1.21
unes
1.21
chy
1.01
iner
0.90
asca
0.88
ELF
0.84
geist
0.83
ueller
0.80
atic
0.79
Activations Density 0.168%