GitPython - Forked from https://github.com/gitpython-developers/GitPython

Age	Commit message (Collapse)	Author
2010-06-08	Its getting better already - intermediate commit before further chaning the ↵	Sebastian Thiel
	task class
2010-06-08	queue: adjusted queue to be closable ( without own testing yet, except for ↵	Sebastian Thiel
	the pool which runs it ) - its not yet stable, but should be solvable.
2010-06-08	The new channeldesign actually works, but it also shows that its located at ↵	Sebastian Thiel
	the wrong spot. The channel is nothing more than an adapter allowing to read multiple items from a thread-safe queue, the queue itself though must be 'closable' for writing, or needs something like a writable flag.
2010-06-08	both versions of the async queue still have trouble in certain situations, ↵	Sebastian Thiel
	at least with my totally overwritten version of the condition - the previous one was somewhat more stable it seems. Nonetheless, this is the fastest version so far
2010-06-08	test implementation of async-queue with everything stripped from it that ↵	Sebastian Thiel
	didn't seem necessary - its a failure, something is wrong - performance not much better than the original one, its depending on the condition performance actually, which I don't get faster
2010-06-07	Task scheduled items lock now uses a dummy lock in serial mode, improving ↵	Sebastian Thiel
	its performance considerably. Channels now use the AsyncQueue, boosting their throughput to about 5k items / s - this is something one can work with, considering the runtime of each item should be large enough to keep the threads busy. This could be a basis, further testing needed
2010-06-07	Channel now uses the AsyncQueue, boosting performance by factor 4, its a start	Sebastian Thiel

2010-06-07	introduced a new counter keeping track of the scheduled tasks - this prevent ↵	Sebastian Thiel
	unnecessary tasks to be scheduled as we keep track of how many items will be produced for the task at hand. This introduces additional locking, but performns well in multithreaded mode. Performance of the master queue is still a huge issue, its currently the limiting factor, as bypassing the master queue in serial moode gives 15x performance, wich is what I would need
2010-06-07	improved testing to test the actual async handling of the pool. there are ↵	Sebastian Thiel
	still inconsistencies that need to be fixed, but it already improved, especially the 4-thread performance which now is as fast as the dual-threaded performance
2010-06-07	task: Fixed incorrect handling of channel closure. Performance is alright ↵	Sebastian Thiel
	for up to 2 threads, but 4 are killing the queue
2010-06-07	Moved pool utilities into util module, fixed critical issue that caused ↵	Sebastian Thiel
	havok - lets call this a safe-state
2010-06-07	added high-speed locking facilities, allowing our Queue to be faster, at ↵	Sebastian Thiel
	least in tests, and with multiple threads. There is still an sync bug in regard to closed channels to be fixed, as the Task.set_done handling is incorrecft
2010-06-07	Added task order cache, and a lock to prevent us walking the graph while ↵	Sebastian Thiel
	changing tasks Now processing more items to test performance, in dual-threaded mode as well, and its rather bad, have to figure out the reason for this, probably gil, but queues could help
2010-06-07	changed scheduling and chunksize calculation in respect to the ↵	Sebastian Thiel
	task.min_count, to fix theoretical option for a deadlock in serial mode, and unnecessary blocking in async mode
2010-06-07	pool.consumed_tasks: is now a queue to be thread safe, in preparation for ↵	Sebastian Thiel
	multiple connected pools Reduced waiting time in tests to make them complete faster
2010-06-07	pool: First version which works as expected in async mode. Its just using a ↵	Sebastian Thiel
	single task for now, but next up are dependent tasks
2010-06-06	channel.read: enhanced to be sure we don't run into non-atomicity issues ↵	Sebastian Thiel
	related to our channel closed flag, which is the only way not to block forever on read(0) channels which were closed by a thread 'in the meanwhile'
2010-06-06	Plenty of fixes in the chunking routine, made possible by a serialized ↵	Sebastian Thiel
	chunking test. Next up, actual async processing
2010-06-06	First step of testing the pool - tasks have been separated into a new module ↵	Sebastian Thiel
	including own tests, their design improved to prepare them for some specifics that would be needed for multiprocessing support
2010-06-06	thread: adjusted worker thread not to provide an output queue anymore - this ↵	Sebastian Thiel
	is handled by the task system graph: implemented it including test according to the pools requirements pool: implemented set_pool_size
2010-06-06	Improved pool design and started rough implementation, top down to learn ↵	Sebastian Thiel
	while going. Tests will be written soon for verification, its still quite theoretical
2010-06-05	Renamed mp to async, as this is a much better name for what is actually ↵	Sebastian Thiel
	going on. The default implementation uses threads, which ends up being nothing more than async, as they are all locked down by internal and the global interpreter lock
2010-06-05	Moved multiprocessing modules into own package, as they in fact have nothing ↵	Sebastian Thiel
	to do with the object db. If that really works the way I want, it will become an own project, called async
2010-06-05	Initial pool design added, allowing for lazy channel based evaluation of ↵	Sebastian Thiel
	inter-dependent tasks
2010-06-05	A code donation: Donating a worker thread implementation inclduding tests to ↵	Sebastian Thiel
	Git-Python. I have the feeling it can do much good here :)
2010-06-05	Added basic channel implementation including test	Sebastian Thiel
	restructured odb tests, they are now in an own module to keep the modules small
2010-06-05	Removed compression flag from IStream and OStream types, as a valid object ↵	Sebastian Thiel
	will always be compressed if generated by the system ( even future memory db's will compress it ) loose db: implemented direct stream copy, indicated by a sha set in the IStream, including test. This will be the case once Packs are exploded for instance
2010-06-04	Implemented stream tests, found a bug on the way, slowly a test-framework ↵	Sebastian Thiel
	for streams starts to show up, but its not yet there
2010-06-04	Merge branch 'odb'	Sebastian Thiel
	Conflicts: lib/git/cmd.py
2010-06-04	Fixed implementation after design change to deal with it - all tests run, ↵	Sebastian Thiel
	but next there will have to be more through testing
2010-06-04	initial version of new odb design to facilitate a channel based ↵	Sebastian Thiel
	multi-threading implementation of all odb functions
2010-06-04	db: implemented GitObjectDB using the git command to make sure we can lookup ↵	Sebastian Thiel
	everything. Next is to implement pack-file reading, then alternates which should allow to resolve everything
2010-06-03	Fixed compatability issues with python 2.5, made sure all tests run	Sebastian Thiel

2010-06-03	commit.create_from_tree now uses pure python implementation, fixed message ↵	Sebastian Thiel
	parsing which truncated newlines although it was ilegitimate. Its up to the reader to truncate therse, nowhere in the git code I could find anyone adding newlines to commits where it is written Added performance tests for serialization, it does about 5k commits per second if writing to tmpfs
2010-06-03	Added performance comparison to cgit ... and yes, git-python is faster :)	Sebastian Thiel

2010-06-03	odb: fixed streamed decompression reader ( specific tests would still be ↵	Sebastian Thiel
	missing ) and added performance tests which are extremely promising
2010-06-03	odb: implemented loose object streaming, which is impossible to do ↵	Sebastian Thiel
	efficiently considering that it copies string buffers all the time
2010-06-03	git.cmd: using communicate in the main branch of execution, which might not ↵	Sebastian Thiel
	make a big difference, but perhaps its smarter about broken pipes. Adjusted code to selectively strip terminating newline, only if they are there. The previous code would effectively duplicate the string and strip whitespace from both ends even though there was no need for it. Its a bit faster now as the tests proclaim
2010-06-03	git.cmd: moved hardcoded chunksize when duplicating stream data into ↵	Sebastian Thiel
	easy-to-change class member variable
2010-06-02	added frame for object reading, including simple test	Sebastian Thiel

2010-06-02	initial version of loose object writing and simple cached object lookup ↵	Sebastian Thiel
	appears to be working
2010-06-02	Added first design and frame for object database. In a first step, loose ↵	Sebastian Thiel
	objects will be written using our utilities, and certain object retrieval functionality moves into the GitObjectDatabase which is used by the repo instance Added performance test for object database access, which shows quite respectable tree parsing performance, and okay blob access. Nonetheless, it will be hard to beat the c performance using a pure python implementation, but it can be a nice practice to write it anyway to allow more direct pack manipulations. Some could benefit from the ability to write packs as these can serve as local cache if alternates are used
2010-06-02	git.cmd: added test for stream section constraint used in git command, found ↵	Sebastian Thiel
	bug of course which just didn't kick in yet
2010-06-02	commit: redesigned revlist and commit parsing, commits are always retrieved ↵	Sebastian Thiel
	from their object information directly. This is faster, and resolves issues with the rev-list format and empty commit messages Adjusted many tests to go with the changes, as they were still mocked. The mock was removed if necessary and replaced by code that actually executes
2010-06-02	commit: refactored existing code to decode commits from streams - ↵	Sebastian Thiel
	performance is slightly better git.cmd: added method to provide access to the content stream directly. This is more efficient if large objects are handled, if it is actually used test.helpers: removed unnecessary code
2010-06-02	commit: initial version of commit_from_tree which could create commit ↵	Sebastian Thiel
	objects if it could serialize itself
2010-05-31	gitcmd: may now receive extra keyword arguments to be passed directly to the ↵	Sebastian Thiel
	subproces.Popen invocation. It could be used to pass custom environments, without changing the own one (#26)
2010-05-27	cmd: By default, on linux, the parent file handles will be closed to leave ↵	Sebastian Thiel
	the child less cluttered, and make it easier to debug as it will only have the file descriptors we set. It appears to be more stable regarding the stdin-is-closed-but-child-doesn't-realize-this issue
2010-05-26	index: index-add fixed to always append a newline after each item. In git ↵	Sebastian Thiel
	has unified its way it reads from stdin, now it wants all items to be terminated by a newline usually. Previously, it could have been that it really didn't want to have a termination character when the last item was written to the file. Bumped the minimum requirements to 1.7.0 to be sure it is working as I think it will. Still, I have to admit that sometime it just appears the closed pipe will not stop git from waiting for more input, at least with the previous implementation
2010-05-26	refs: a Reference can now be created by assigning a commit or object (for ↵	Sebastian Thiel
	convenience)