threads - Perl interpreter-based threads |
threads - Perl interpreter-based threads
This document describes threads version 1.69
use threads ('yield', 'stack_size' => 64*4096, 'exit' => 'threads_only', 'stringify');
sub start_thread { my @args = @_; print('Thread started: ', join(' ', @args), "\n"); } my $thr = threads->create('start_thread', 'argument'); $thr->join();
threads->create(sub { print("I am a thread\n"); })->join();
my $thr2 = async { foreach (@files) { ... } }; $thr2->join(); if (my $err = $thr2->error()) { warn("Thread error: $err\n"); }
# Invoke thread in list context (implicit) so it can return a list my ($thr) = threads->create(sub { return (qw/a b c/); }); # or specify list context explicitly my $thr = threads->create({'context' => 'list'}, sub { return (qw/a b c/); }); my @results = $thr->join();
$thr->detach();
# Get a thread's object $thr = threads->self(); $thr = threads->object($tid);
# Get a thread's ID $tid = threads->tid(); $tid = $thr->tid(); $tid = "$thr";
# Give other threads a chance to run threads->yield(); yield();
# Lists of non-detached threads my @threads = threads->list(); my $thread_count = threads->list();
my @running = threads->list(threads::running); my @joinable = threads->list(threads::joinable);
# Test thread objects if ($thr1 == $thr2) { ... }
# Manage thread stack size $stack_size = threads->get_stack_size(); $old_size = threads->set_stack_size(32*4096);
# Create a thread with a specific context and stack size my $thr = threads->create({ 'context' => 'list', 'stack_size' => 32*4096, 'exit' => 'thread_only' }, \&foo);
# Get thread's context my $wantarray = $thr->wantarray();
# Check thread's state if ($thr->is_running()) { sleep(1); } if ($thr->is_joinable()) { $thr->join(); }
# Send a signal to a thread $thr->kill('SIGUSR1');
# Exit a thread threads->exit();
Perl 5.6 introduced something called interpreter threads. Interpreter threads are different from 5005threads (the thread model of Perl 5.005) by creating a new Perl interpreter per thread, and not sharing any data or state between threads by default.
Prior to Perl 5.8, this has only been available to people embedding Perl, and
for emulating fork()
on Windows.
The threads API is loosely based on the old Thread.pm API. It is very important to note that variables are not shared between threads, all variables are by default thread local. To use shared variables one must also use the threads::shared manpage:
use threads; use threads::shared;
It is also important to note that you must enable threads by doing use
threads
as early as possible in the script itself, and that it is not
possible to enable threading inside an eval ""
, do
, require
, or
use
. In particular, if you are intending to share variables with
the threads::shared manpage, you must use threads
before you use threads::shared
.
(threads
will emit a warning if you do it the other way around.)
undef
if thread creation failed.
FUNCTION may either be the name of a function, an anonymous subroutine, or a code ref.
my $thr = threads->create('func_name', ...); # or my $thr = threads->create(sub { ... }, ...); # or my $thr = threads->create(\&func, ...);
The ->new()
method is an alias for ->create()
.
join()
->join()
will return the return value(s)
of the
entry point function.
The context (void, scalar or list) for the return value(s)
for ->join()
is determined at the time of thread creation.
# Create thread in list context (implicit) my ($thr1) = threads->create(sub { my @results = qw(a b c); return (@results); }); # or (explicit) my $thr1 = threads->create({'context' => 'list'}, sub { my @results = qw(a b c); return (@results); }); # Retrieve list results from thread my @res1 = $thr1->join();
# Create thread in scalar context (implicit) my $thr2 = threads->create(sub { my $result = 42; return ($result); }); # Retrieve scalar result from thread my $res2 = $thr2->join();
# Create a thread in void context (explicit) my $thr3 = threads->create({'void' => 1}, sub { print("Hello, world\n"); }); # Join the thread in void context (i.e., no return value) $thr3->join();
See THREAD CONTEXT for more details.
If the program exits without all threads having either been joined or detached, then a warning will be issued.
Calling ->join()
or ->detach()
on an already joined thread will
cause an error to be thrown.
detach()
If the program exits without all threads having either been joined or detached, then a warning will be issued.
Calling ->join()
or ->detach()
on an already detached thread
will cause an error to be thrown.
detach()
self()
tid()
tid()
stringify
import option to your use threads
declaration,
then using a threads object in a string or a string context (e.g., as a hash
key) will cause its ID to be used as the value:
use threads qw(stringify);
my $thr = threads->create(...); print("Thread $thr started...\n"); # Prints out: Thread 1 started...
object($tid)
undef
if there is no thread
associated with the TID, if the thread is joined or detached, if no TID is
specified or if the specified TID is undef.
yield()
You may do use threads qw(yield)
, and then just use yield()
in your
code.
list()
list(threads::all)
list(threads::running)
list(threads::joinable)
threads::all
) and in a list context, returns a
list of all non-joined, non-detached threads objects. In a scalar context,
returns a count of the same.
With a true argument (using threads::running
), returns a list of all
non-joined, non-detached threads objects that are still running.
With a false argument (using threads::joinable
), returns a list of all
non-joined, non-detached threads objects that have finished running (i.e.,
for which ->join()
will not block).
equal($thr2)
if ($thr1 == $thr2) { print("Threads are the same\n"); } # or if ($thr1 != $thr2) { print("Threads differ\n"); }
(Thread comparison is based on thread IDs.)
async
creates a thread to execute the block immediately following
it. This block is treated as an anonymous subroutine, and so must have a
semicolon after the closing brace. Like threads->create()
, async
returns a threads object.
error()
eval
context. This method will return undef
if the thread terminates normally. Otherwise, it returns the value of
$@
associated with the thread's execution status in its eval
context.
_handle()
HANDLE
value returned by CreateThread
(i.e., HANDLE *
); for other
platforms, it is a pointer to the pthread_t
structure used in the
pthread_create
call (i.e., pthread_t *
).
This method is of no use for general Perl threads programming. Its intent is to provide other (XS-based) thread modules with the capability to access, and possibly manipulate, the underlying thread structure associated with a Perl thread.
_handle()
The usual method for terminating a thread is to return() from the entry point function with the appropriate return value(s).
exit()
threads->exit()
. This will cause the thread to return undef
in a
scalar context, or the empty list in a list context.
When called from the main thread, this behaves the same as exit(0)
.
exit(status)
threads->exit()
(i.e., the
exit status code is ignored).
When called from the main thread, this behaves the same as exit(status)
.
die()
die()
in a thread indicates an abnormal exit for the thread. Any
$SIG{__DIE__}
handler in the thread will be called first, and then the
thread will exit with a warning message that will contain any arguments passed
in the die()
call.
exit(status)
exit()
inside
threaded code, or in modules that might be used in threaded applications, is
strongly discouraged.
If exit()
really is needed, then consider using the following:
threads->exit() if threads->can('exit'); # Thread friendly exit(status);
exit()
inside a
thread, and effectively causes such calls to behave the same as
threads->exit()
. In other words, with this setting, calling exit()
causes only the thread to terminate.
Because of its global effect, this setting should not be used inside modules or the like.
The main thread is unaffected by this setting.
exit()
inside the newly created
thread only.
set_thread_exit_only(boolean)
exit()
will cause only the
thread to exit. With a false argument, exit()
will terminate the
application.
The main thread is unaffected by this call.
set_thread_exit_only(boolean)
exit()
.
The main thread is unaffected by this call.
The following boolean methods are useful in determining the state of a thread.
is_running()
is_joinable()
$thr->join()
will not block.
is_detached()
is_detached()
As with subroutines, the type of value returned from a thread's entry point
function may be determined by the thread's context: list, scalar or void.
The thread's context is determined at thread creation. This is necessary so
that the context is available to the entry point function via
wantarray(). The thread may then specify a value of
the appropriate type to be returned from ->join()
.
Because thread creation and thread joining may occur in different contexts, it
may be desirable to state the context explicitly to the thread's entry point
function. This may be done by calling ->create()
with a hash reference
as the first argument:
my $thr = threads->create({'context' => 'list'}, \&foo); ... my @results = $thr->join();
In the above, the threads object is returned to the parent thread in scalar
context, and the thread's entry point function foo
will be called in list
(array) context such that the parent thread can receive a list (array) from
the ->join()
call. ('array'
is synonymous with 'list'
.)
Similarly, if you need the threads object, but your thread will not be returning a value (i.e., void context), you would do the following:
my $thr = threads->create({'context' => 'void'}, \&foo); ... $thr->join();
The context type may also be used as the key in the hash reference followed by a true value:
threads->create({'scalar' => 1}, \&foo); ... my ($thr) = threads->list(); my $result = $thr->join();
If not explicitly stated, the thread's context is implied from the context
of the ->create()
call:
# Create thread in list context my ($thr) = threads->create(...);
# Create thread in scalar context my $thr = threads->create(...);
# Create thread in void context threads->create(...);
wantarray()
This returns the thread's context in the same manner as wantarray().
wantarray()
Class method to return the current thread's context. This returns the same value as running wantarray() inside the current thread's entry point function.
The default per-thread stack size for different platforms varies significantly, and is almost always far more than is needed for most applications. On Win32, Perl's makefile explicitly sets the default stack to 16 MB; on most other platforms, the system default is used, which again may be much larger than is needed.
By tuning the stack size to more accurately reflect your application's needs, you may significantly reduce your application's memory usage, and increase the number of simultaneously running threads.
Note that on Windows, address space allocation granularity is 64 KB, therefore, setting the stack smaller than that on Win32 Perl will not save any more memory.
Some platforms have a minimum thread stack size. Trying to set the stack size below this value will result in a warning, and the minimum stack size will be used.
Some Linux platforms have a maximum stack size. Setting too large of a stack size will cause thread creation to fail.
If needed, $new_size
will be rounded up to the next multiple of the memory
page size (usually 4096 or 8192).
Threads created after the stack size is set will then either call
pthread_attr_setstacksize()
(for pthreads platforms), or supply the
stack size to CreateThread()
(for Win32 Perl).
(Obviously, this call does not affect any currently extant threads.)
PERL5_ITHREADS_STACK_SIZE
:
PERL5_ITHREADS_STACK_SIZE=1048576 export PERL5_ITHREADS_STACK_SIZE perl -e'use threads; print(threads->get_stack_size(), "\n")'
This value overrides any stack_size
parameter given to use threads
. Its
primary purpose is to permit setting the per-thread stack size for legacy
threaded applications.
->create()
with a hash reference as the first argument:
my $thr = threads->create({'stack_size' => 32*4096}, \&foo, @args);
$thr2
) that inherits the stack size from an
existing thread ($thr1
). This is shorthand for the following:
my $stack_size = $thr1->get_stack_size(); my $thr2 = threads->create({'stack_size' => $stack_size}, FUNCTION, ARGS);
When safe signals is in effect (the default behavior - see Unsafe signals for more details), then signals may be sent and acted upon by individual threads.
->kill()
.
Returns the thread object to allow for method chaining:
$thr->kill('SIG...')->join();
Signal handlers need to be set up in the threads for the signals they are expected to act upon. Here's an example for cancelling a thread:
use threads;
sub thr_func { # Thread 'cancellation' signal handler $SIG{'KILL'} = sub { threads->exit(); };
... }
# Create a thread my $thr = threads->create('thr_func');
...
# Signal the thread to terminate, and then detach # it so that it will get cleaned up automatically $thr->kill('KILL')->detach();
Here's another simplistic example that illustrates the use of thread signalling in conjunction with a semaphore to provide rudimentary suspend and resume capabilities:
use threads; use Thread::Semaphore;
sub thr_func { my $sema = shift;
# Thread 'suspend/resume' signal handler $SIG{'STOP'} = sub { $sema->down(); # Thread suspended $sema->up(); # Thread resumes };
... }
# Create a semaphore and pass it to a thread my $sema = Thread::Semaphore->new(); my $thr = threads->create('thr_func', $sema);
# Suspend the thread $sema->down(); $thr->kill('STOP');
...
# Allow the thread to continue $sema->up();
CAVEAT: The thread signalling capability provided by this module does not
actually send signals via the OS. It emulates signals at the Perl-level
such that signal handlers are called in the appropriate thread. For example,
sending $thr->kill('STOP')
does not actually suspend a thread (or the
whole process), but does cause a $SIG{'STOP'}
handler to be called in that
thread (as illustrated above).
As such, signals that would normally not be appropriate to use in the
kill()
command (e.g., kill('KILL', $$)
) are okay to use with the
->kill()
method (again, as illustrated above).
Correspondingly, sending a signal to a thread does not disrupt the operation the thread is currently working on: The signal will be acted upon after the current operation has completed. For instance, if the thread is stuck on an I/O call, sending it a signal will not cause the I/O call to be interrupted such that the signal is acted up immediately.
Sending a signal to a terminated thread is ignored.
NOTE: If the main thread exits, then this warning cannot be suppressed
using no warnings 'threads';
as suggested below.
pthread_create
to determine the actual
cause for the failure.
threads->exit()
. For example, the thread
may have terminated because of an error, or by using die
.
If needed, thread warnings can be suppressed by using:
no warnings 'threads';
in the appropriate scope.
useithreads
configuration option.
Having threads support requires all of Perl and all of the XS modules in the Perl installation to be rebuilt; it is not just a question of adding the the threads manpage module (i.e., threaded and non-threaded Perls are binary incompatible.)
$thr->set_stack_size($size);
->kill()
signalling method.
See Unsafe signals for more details.
->kill()
call.
Before you consider posting a bug report, please consult, and possibly post a message to the discussion forum to see if what you've encountered is a known problem.
If the module will only be used inside a thread, you can try loading the
module from inside the thread entry point function using require
(and
import
if needed):
sub thr_func { require Unsafe::Module # Unsafe::Module->import(...);
.... }
If the module is needed inside the main thread, try modifying your
application so that the module is loaded (again using require
and
->import()
) after any threads are started, and in such a way that no
other threads are started afterwards.
If the above does not work, or is not adequate for your application, then file a bug report on http://rt.cpan.org/Public/ against the problematic module.
chdir()
) will affect all the threads in the application.
On MSWin32, each thread maintains its own the current working directory setting.
system()
or back-ticks) made from threads use the environment variable
settings from the main thread. In other words, changes made to %ENV
in
a thread will not be visible in system calls made by that thread.
To work around this, set environment variables as part of the system call. For example:
my $msg = 'hello'; system("FOO=$msg; echo \$FOO"); # Outputs 'hello' to STDOUT
On MSWin32, each thread maintains its own set of environment variables.
BEGIN
, CHECK
or INIT
blocks should not be
relied upon. Depending on the Perl version and the application code, results
may range from success, to (apparently harmless) warnings of leaked scalar, or
all the way up to crashing of the Perl interpreter.
Safe signals is the default behavior, and the old, immediate, unsafe signalling behavior is only in effect in the following situations:
PERL_OLD_SIGNALS
(see perl -V
).PERL_SIGNALS
is set to unsafe
(see PERL_SIGNALS in the perlrun manpage).If unsafe signals is in effect, then signal handling is not thread-safe, and
the ->kill()
signalling method cannot be used.
Even with the latest version of Perl, it is known that certain constructs with threads may result in warning messages concerning leaked scalars or unreferenced scalars. However, such warnings are harmless, and may safely be ignored.
You can search for the threads manpage related bug reports at http://rt.cpan.org/Public/. If needed submit any new bugs, problems, patches, etc. to: http://rt.cpan.org/Public/Dist/Display.html
Perl 5.8.0 or later
the threads manpage Discussion Forum on CPAN: http://www.cpanforum.com/dist/threads
Annotated POD for the threads manpage: http://annocpan.org/~JDHEDDEN/threads-1.69/threads.pm
Source repository: http://code.google.com/p/threads-shared/
the threads::shared manpage, the perlthrtut manpage
http://www.perl.com/pub/a/2002/06/11/threads.html and http://www.perl.com/pub/a/2002/09/04/threads.html
Perl threads mailing list: http://lists.cpan.org/showlist.cgi
Stack size discussion: http://www.perlmonks.org/
Artur Bergman <sky AT crucially DOT net>
threads is released under the same license as Perl.
CPAN version produced by Jerry D. Hedden <jdhedden AT cpan DOT org>
Richard Soderberg <perl AT crystalflame DOT net> - Helping me out tons, trying to find reasons for races and other weird bugs!
Simon Cozens <simon AT brecon DOT co DOT uk> - Being there to answer zillions of annoying questions
Rocco Caputo <troc AT netrus DOT net>
Vipul Ved Prakash <mail AT vipul DOT net> - Helping with debugging
Dean Arnold <darnold AT presicient DOT com> - Stack size API
threads - Perl interpreter-based threads |