Mempool fix #2741

jenswi-linaro · 2019-01-16T07:49:03Z

No description provided.

jforissier · 2019-01-16T08:03:45Z

This adds some more complexity :( Why not simply revert d4f909c? Do we have numbers showing the optimization is significant?

jenswi-linaro · 2019-01-16T08:50:55Z

Here's my results on Hikey:
github/master (edbb89f):

4006  real      1m 41.11s
4007  real      1m 14.51s
4008  real      0m 0.13s
4009  real      1m 5.68s

Revert "mempool: optimize reference counting":

4006  real      3m 27.78s
4007  real      0m 50.03s
4008  real      0m 0.13s
4009  real      2m 24.07s

Using this script

for a in 4006 4007 4008 4009 ; do \
echo -n $a " " >> time.txt ;\
time -o time.txt.tmp xtest -l 15 $a || break ;\
grep real time.txt.tmp >> time.txt
done
cat time.txt

jforissier · 2019-01-16T09:24:55Z

Thanks for the numbers. So there is quite a big difference indeed... How does the code in this PR compare?

Assuming the code proposed here is still much faster, that worries me about the performance of our mutexes. The original code, with a mutex and a condvar, is a typical pattern... Is there a way we could optimize the mutex common case (locking, unlocked state)? With an atomic_load() maybe?

jenswi-linaro · 2019-01-16T11:38:27Z

"mempool: fix race in get_pool()"

4006  real      1m 37.51s
4007  real      0m 56.67s
4008  real      0m 0.09s
4009  real      1m 3.18s

Second run

4006  real      1m 37.61s
4007  real      0m 35.32s
4008  real      0m 0.13s
4009  real      1m 3.15s

So I'd say it's if anything slightly faster.

I think we should try to improve the fast path of the mutex. I'm not so sure we'll get numbers that can compete with these, but it will be useful anyway. I guess we could base the fast path on the atomic_cas_ushort() function.

jenswi-linaro · 2019-01-16T11:44:09Z

What we need here is a recursive mutex.

jforissier · 2019-01-16T12:11:35Z

OK, thanks for running the benchmark again. I agree that manipulating mutex::state with atomic_cas_ushort() sounds like a good idea for the fast path. But that would be for a future PR I suppose. I'm fine with this for the coming release. Could you perhaps add the performance results to the commit text for the record?

Acked-by: Jerome Forissier <jerome.forissier@linaro.org>

lorc · 2019-01-16T12:47:49Z

lib/libutils/ext/mempool.c

+	 * any value but our thread id.
+	 */
+	if (atomic_load_int(&pool->owner) == thread_get_id()) {
+		if (!refcount_inc(&pool->refc))


I think, that this code is still prone to race. You have two atomic operations there, but you need only one.
Suppose that Thread 0 is at line 107 and Tread 1 is at L82. Thread 1 passes check and thinks that it owns the pool. But Thread 0 resets pool owner.

I can't see how this can be fixed with atomic variables. Basically, you need to execute operation "check owner and increase refcount" atomically. But you are are doing two atomic operations instead - "check owner" and "increase refcount".

I propose to drop atomics at all and introduce spinlock. With spinlock you really can operate atomically on owner and refcount.

Only the thread owning the pool may set it to free. The case you're describing is not supposed to happen, it's like unlocking a mutex with wrong thread. The code doesn't depend on refcount_inc() and refcount_dec() being atomic, it just happens to be a convenient interface.

Ah yes, you are right.

jenswi-linaro · 2019-01-16T13:07:37Z

Updated commit message and added tag. Please wait with the merge a bit to see that we can agree with @lorc .

jforissier · 2019-01-16T13:17:06Z

@jenswi-linaro of course.

lorc · 2019-01-16T13:17:51Z

Reviewed-by: Volodymyr Babchuk <vlad.babchuk@gmail.com>

Adds atomic_load_int() and atomic_store_int(). Reviewed-by: Volodymyr Babchuk <vlad.babchuk@gmail.com> Acked-by: Jerome Forissier <jerome.forissier@linaro.org> Signed-off-by: Jens Wiklander <jens.wiklander@linaro.org>

Fixes a race in get_pool() which could leave the pool with zero refences but still owned by the last thread using the pool. Some performance number on Hikey with default configuration: github/master (edbb89f, before this commit): 4006 real 1m 41.11s 4007 real 1m 14.51s 4008 real 0m 0.13s 4009 real 1m 5.68s Revert "mempool: optimize reference counting", before this commit: 4006 real 3m 27.78s 4007 real 0m 50.03s 4008 real 0m 0.13s 4009 real 2m 24.07s With this commit, two runs: 4006 real 1m 37.51s 4007 real 0m 56.67s 4008 real 0m 0.09s 4009 real 1m 3.18s 4006 real 1m 37.61s 4007 real 0m 35.32s 4008 real 0m 0.13s 4009 real 1m 3.15s Numbers are gathered with this script: for a in 4006 4007 4008 4009 ; do \ echo -n $a " " >> time.txt ;\ time -o time.txt.tmp xtest -l 15 $a || break ;\ grep real time.txt.tmp >> time.txt done cat time.txt Reviewed-by: Volodymyr Babchuk <vlad.babchuk@gmail.com> Acked-by: Jerome Forissier <jerome.forissier@linaro.org> Signed-off-by: Jens Wiklander <jens.wiklander@linaro.org>

jenswi-linaro · 2019-01-16T13:32:12Z

Tag applie

jenswi-linaro mentioned this pull request Jan 16, 2019

xtest 1013 hangs in normal world #2737

Closed

lorc reviewed Jan 16, 2019

View reviewed changes

jenswi-linaro force-pushed the mempool_fix branch from acc6c5a to 7d734b7 Compare January 16, 2019 13:06

jenswi-linaro added 2 commits January 16, 2019 14:31

atomic.h: add atomic_{load,store}_int()

60cdbdc

Adds atomic_load_int() and atomic_store_int(). Reviewed-by: Volodymyr Babchuk <vlad.babchuk@gmail.com> Acked-by: Jerome Forissier <jerome.forissier@linaro.org> Signed-off-by: Jens Wiklander <jens.wiklander@linaro.org>

jenswi-linaro force-pushed the mempool_fix branch from 7d734b7 to d1644ff Compare January 16, 2019 13:32

jforissier merged commit 60b3990 into OP-TEE:master Jan 16, 2019

jenswi-linaro deleted the mempool_fix branch January 16, 2019 14:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mempool fix #2741

Mempool fix #2741

jenswi-linaro commented Jan 16, 2019

jforissier commented Jan 16, 2019

jenswi-linaro commented Jan 16, 2019

jforissier commented Jan 16, 2019

jenswi-linaro commented Jan 16, 2019

jenswi-linaro commented Jan 16, 2019

jforissier commented Jan 16, 2019 •

edited

Loading

lorc Jan 16, 2019

jenswi-linaro Jan 16, 2019

lorc Jan 16, 2019

jenswi-linaro commented Jan 16, 2019

jforissier commented Jan 16, 2019

lorc commented Jan 16, 2019

jenswi-linaro commented Jan 16, 2019

Mempool fix #2741

Mempool fix #2741

Conversation

jenswi-linaro commented Jan 16, 2019

jforissier commented Jan 16, 2019

jenswi-linaro commented Jan 16, 2019

jforissier commented Jan 16, 2019

jenswi-linaro commented Jan 16, 2019

jenswi-linaro commented Jan 16, 2019

jforissier commented Jan 16, 2019 • edited Loading

lorc Jan 16, 2019

Choose a reason for hiding this comment

jenswi-linaro Jan 16, 2019

Choose a reason for hiding this comment

lorc Jan 16, 2019

Choose a reason for hiding this comment

jenswi-linaro commented Jan 16, 2019

jforissier commented Jan 16, 2019

lorc commented Jan 16, 2019

jenswi-linaro commented Jan 16, 2019

jforissier commented Jan 16, 2019 •

edited

Loading