aboutsummaryrefslogtreecommitdiff
path: root/lib
AgeCommit message (Collapse)Author
2010-06-10InputChannelTask now has interface for properly handling the reading from ↵Sebastian Thiel
the same and different pools
2010-06-10messy first version of a properly working depth-first graph method, which ↵Sebastian Thiel
allows the pool to work as expected. Many more tests need to be added, and there still is a problem with shutdown as sometimes it won't kill all threads, mainly because the process came up with worker threads started, which cannot be
2010-06-09test: prepared task dependency test, which already helped to find bug in the ↵Sebastian Thiel
reference counting mechanism, causing references to the pool to be kepts via cycles
2010-06-09task: redesigned write channel access to allow the task creator to set own ↵Sebastian Thiel
write channels, possibly some with callbacks installed etc.. Pool.add_task will respect the users choice now, but provide defaults which are optimized for performance
2010-06-09Channel: Callbacks reviewed - they are now part of Subclasses of the default ↵Sebastian Thiel
channel implementation, one of which is used as base by the Pool Read channel, releasing it of the duty to call these itself. The write channel with callback subclass allows the transformation of the item to be written
2010-06-09task: removed scheduled task support, which at some point was introduced to ↵Sebastian Thiel
improve performance, but which now hinders performance, besides being unnecessary ;)
2010-06-09Channel: removed pseudoconstructor, which clearly improves the design and ↵Sebastian Thiel
makes it easier to constomize pool: in serial mode, created channels will be serial-only, which brings 15% of performance
2010-06-09Channel: Read method revised - now it really really doesn't block anymore, ↵Sebastian Thiel
and it runs faster as well, about 2/3 of the performance we have when being in serial mode
2010-06-09HSCondition: Fixed terrible bug which it inherited from its default python ↵Sebastian Thiel
Condition implementation, related to the notify method not being treadsafe. Although I was aware of it, I missed the first check which tests for the size - the result could be incorrect if the whole method wasn't locked. Testing runs stable now, allowing to move on \!
2010-06-09HSCondition: now gets a lock even in the single-notify case, as it was ↵Sebastian Thiel
required due to the non-atomiciy of the invovled operation. Removed one level of indirection for the lock, by refraining from calling my own 'wrapper' methods, which brought it back to the performance it had before the locking was introduced for the n==1 case
2010-06-09HSCondition: now deriving from deque, as the AsyncQeue does, to elimitate ↵Sebastian Thiel
one more level of indirection. Clearly this not good from a design standpoint, as a Condition is no Deque, but it helps speeding things up which is what this is about. Could make it a hidden class to indicate how 'special' it is
2010-06-09HSCondition: using a deck to store waiters, for further speedupSebastian Thiel
2010-06-09thread: fixed initialization problem if an empty iterable was handed inSebastian Thiel
queue: Queue now derives from deque directly, which safes one dict lookup as the queue does not need to be accessed through self anymore pool test improved to better verify threads are started correctly
2010-06-09queue: fixed critical bug in the notify method, as it was not at all ↵Sebastian Thiel
thread-safe, causing locks to be released multiple times. Now it runs very fast, and very stable apparently. Now its about putting previous features back in, and studying their results, before more complex task graphs can be examined
2010-06-08workerthread: adjusted to use a blocking queue, it will receive termination ↵Sebastian Thiel
events only with its queue, with boosts performance into brigt green levels
2010-06-08Revised task deletion works well, adjusted test to be creating new tasks all ↵Sebastian Thiel
the time instead of reusing its own one, it was somewhat hard to manage its state over time and could cause bugs. It works okay, but it occasionally hangs, it appears to be an empty queue, have to gradually put certain things back in, although in the current mode of operation, it should never have empty queues from the pool to the user
2010-06-08task: now deletes itself once its done - for the test this doesn't change a ↵Sebastian Thiel
thing as the task deletes itself too late - its time for a paradigm change, the task should be deleted with its RPoolChannel or explicitly by the user. The test needs to adapt, and shouldn't assume anything unless the RPoolChannel is gone
2010-06-08Its getting better already - intermediate commit before further chaning the ↵Sebastian Thiel
task class
2010-06-08queue: adjusted queue to be closable ( without own testing yet, except for ↵Sebastian Thiel
the pool which runs it ) - its not yet stable, but should be solvable.
2010-06-08The new channeldesign actually works, but it also shows that its located at ↵Sebastian Thiel
the wrong spot. The channel is nothing more than an adapter allowing to read multiple items from a thread-safe queue, the queue itself though must be 'closable' for writing, or needs something like a writable flag.
2010-06-08both versions of the async queue still have trouble in certain situations, ↵Sebastian Thiel
at least with my totally overwritten version of the condition - the previous one was somewhat more stable it seems. Nonetheless, this is the fastest version so far
2010-06-08test implementation of async-queue with everything stripped from it that ↵Sebastian Thiel
didn't seem necessary - its a failure, something is wrong - performance not much better than the original one, its depending on the condition performance actually, which I don't get faster
2010-06-07Task scheduled items lock now uses a dummy lock in serial mode, improving ↵Sebastian Thiel
its performance considerably. Channels now use the AsyncQueue, boosting their throughput to about 5k items / s - this is something one can work with, considering the runtime of each item should be large enough to keep the threads busy. This could be a basis, further testing needed
2010-06-07Channel now uses the AsyncQueue, boosting performance by factor 4, its a startSebastian Thiel
2010-06-07introduced a new counter keeping track of the scheduled tasks - this prevent ↵Sebastian Thiel
unnecessary tasks to be scheduled as we keep track of how many items will be produced for the task at hand. This introduces additional locking, but performns well in multithreaded mode. Performance of the master queue is still a huge issue, its currently the limiting factor, as bypassing the master queue in serial moode gives 15x performance, wich is what I would need
2010-06-07improved testing to test the actual async handling of the pool. there are ↵Sebastian Thiel
still inconsistencies that need to be fixed, but it already improved, especially the 4-thread performance which now is as fast as the dual-threaded performance
2010-06-07task: Fixed incorrect handling of channel closure. Performance is alright ↵Sebastian Thiel
for up to 2 threads, but 4 are killing the queue
2010-06-07Moved pool utilities into util module, fixed critical issue that caused ↵Sebastian Thiel
havok - lets call this a safe-state
2010-06-07added high-speed locking facilities, allowing our Queue to be faster, at ↵Sebastian Thiel
least in tests, and with multiple threads. There is still an sync bug in regard to closed channels to be fixed, as the Task.set_done handling is incorrecft
2010-06-07Added task order cache, and a lock to prevent us walking the graph while ↵Sebastian Thiel
changing tasks Now processing more items to test performance, in dual-threaded mode as well, and its rather bad, have to figure out the reason for this, probably gil, but queues could help
2010-06-07changed scheduling and chunksize calculation in respect to the ↵Sebastian Thiel
task.min_count, to fix theoretical option for a deadlock in serial mode, and unnecessary blocking in async mode
2010-06-07pool.consumed_tasks: is now a queue to be thread safe, in preparation for ↵Sebastian Thiel
multiple connected pools Reduced waiting time in tests to make them complete faster
2010-06-07pool: First version which works as expected in async mode. Its just using a ↵Sebastian Thiel
single task for now, but next up are dependent tasks
2010-06-06channel.read: enhanced to be sure we don't run into non-atomicity issues ↵Sebastian Thiel
related to our channel closed flag, which is the only way not to block forever on read(0) channels which were closed by a thread 'in the meanwhile'
2010-06-06Plenty of fixes in the chunking routine, made possible by a serialized ↵Sebastian Thiel
chunking test. Next up, actual async processing
2010-06-06First step of testing the pool - tasks have been separated into a new module ↵Sebastian Thiel
including own tests, their design improved to prepare them for some specifics that would be needed for multiprocessing support
2010-06-06thread: adjusted worker thread not to provide an output queue anymore - this ↵Sebastian Thiel
is handled by the task system graph: implemented it including test according to the pools requirements pool: implemented set_pool_size
2010-06-06Improved pool design and started rough implementation, top down to learn ↵Sebastian Thiel
while going. Tests will be written soon for verification, its still quite theoretical
2010-06-05Renamed mp to async, as this is a much better name for what is actually ↵Sebastian Thiel
going on. The default implementation uses threads, which ends up being nothing more than async, as they are all locked down by internal and the global interpreter lock
2010-06-05Moved multiprocessing modules into own package, as they in fact have nothing ↵Sebastian Thiel
to do with the object db. If that really works the way I want, it will become an own project, called async
2010-06-05Initial pool design added, allowing for lazy channel based evaluation of ↵Sebastian Thiel
inter-dependent tasks
2010-06-05A code donation: Donating a worker thread implementation inclduding tests to ↵Sebastian Thiel
Git-Python. I have the feeling it can do much good here :)
2010-06-05Added basic channel implementation including testSebastian Thiel
restructured odb tests, they are now in an own module to keep the modules small
2010-06-05Removed compression flag from IStream and OStream types, as a valid object ↵Sebastian Thiel
will always be compressed if generated by the system ( even future memory db's will compress it ) loose db: implemented direct stream copy, indicated by a sha set in the IStream, including test. This will be the case once Packs are exploded for instance
2010-06-04Implemented stream tests, found a bug on the way, slowly a test-framework ↵Sebastian Thiel
for streams starts to show up, but its not yet there
2010-06-04Merge branch 'odb'Sebastian Thiel
Conflicts: lib/git/cmd.py
2010-06-04Fixed implementation after design change to deal with it - all tests run, ↵Sebastian Thiel
but next there will have to be more through testing
2010-06-04initial version of new odb design to facilitate a channel based ↵Sebastian Thiel
multi-threading implementation of all odb functions
2010-06-04db: implemented GitObjectDB using the git command to make sure we can lookup ↵Sebastian Thiel
everything. Next is to implement pack-file reading, then alternates which should allow to resolve everything
2010-06-03Fixed compatability issues with python 2.5, made sure all tests runSebastian Thiel