Bug #1201
closedust_fd close() causes deadlock for any multithreaded application
100%
Description
Hi,
I discovered today an even more serious problem with the current implementation of close/flose().
It does not neither handle the thread cancellation nor make the mutex robustness.
Which means if any of the thread of a multiple threads application got cancelled during calling close()/fclose(), the cancelled thread will terminated leaving the mutex at a persistent lock state.
Any call of close()/fclose() later at any other thread will be blocked forever.
What makes this really serious is because the scenario is quite simple, just thread cancellation during close then it block all other close() forever.
We need to either:
- Disable the thread cancellation at start of lttng_ust_safe_close_fd and re-enable it later.
- Setup the thread cancellation handler and do the mutex if needed.
- Set the mutex as PTHREAD_MUTEX_ROBUST, then check for return code and do pthread_mutex_consistent when needed.
Reproducer:
LD_PRELOAD=/usr/local/lib/liblttng-ust-fd.so ./ust_fd_mutex_battle
/Tai
Files
Updated by Mathieu Desnoyers over 5 years ago
- File fix-1201-1.diff fix-1201-1.diff added
The attached patch implements the first approach (disable pthread cancellation around lock). Does it help ?
Updated by Mathieu Desnoyers over 5 years ago
The patch appears to fix the issue with the reproducer here.
Updated by Tai Dinh over 5 years ago
Mathieu Desnoyers wrote:
The attached patch implements the first approach (disable pthread cancellation around lock). Does it help ?
Yes, I'll try it now.
Thank Mathieu a lot for your quick solution.
Anyway, are you considering other option as well or you think that this has more advantage compared to other alternatives.
/Tai
Updated by Tai Dinh over 5 years ago
Tai Dinh wrote:
Mathieu Desnoyers wrote:
The attached patch implements the first approach (disable pthread cancellation around lock). Does it help ?
Yes, I'll try it now.
Thank Mathieu a lot for your quick solution.Anyway, are you considering other option as well or you think that this has more advantage compared to other alternatives.
/Tai
It works for me as well.
/Tai
Updated by Mathieu Desnoyers over 5 years ago
- Status changed from New to Resolved
- % Done changed from 0 to 100
Applied in changeset lttng-ust|283f4bec47781f7a39dde82d02e53add02155f9c.