serenity - The Serenity Operating System 🐞

Age	Commit message (Collapse)	Author
2022-12-16	AK: Support popping an arbitrary element from a HashTable	Eli Youngs

2022-12-09	Everywhere: Remove unnecessary AK and Detail namespace scoping	Moustafa Raafat

2022-12-03	Everywhere: Run clang-format	Linus Groh

2022-11-26	AK: Make it possible to not `using` AK classes into the global namespace	Andreas Kling
	This patch adds the `USING_AK_GLOBALLY` macro which is enabled by default, but can be overridden by build flags. This is a step towards integrating Jakt and AK types.
2022-11-11	AK: Don't crash in HashTable::clear_with_capacity on an empty table	Zaggy1024
	When calling clear_with_capacity on an empty HashTable/HashMap, a null deref would occur when trying to memset() m_buckets. Checking that it has capacity before clearing fixes the issue.
2022-06-23	AK: Zero previous pointer after fixing the insertion list in HashTable	Hendiadyoin1

2022-06-22	AK: Clear the previous and next pointers of deleted HashTable buckets	Idan Horowitz
	Usually the values of the previous and next pointers of deleted buckets are never used, as they're not part of the main ordered bucket chain, but if an in-place rehashing is done, which results in the bucket being turned into a free bucket, the stale pointers will remain, at which point any item that is inserted into said free-bucket will have either a stale previous pointer if the HashTable was empty on insertion, or a stale next pointer, resulting in undefined behaviour. This commit also includes a new HashMap test that reproduces this issue
2022-05-08	AK+LibGUI: Pass predicate to *_matching() methods by const reference	Vitaly Dyachkov

2022-04-01	Everywhere: Run clang-format	Idan Horowitz

2022-03-31	AK: Use bucket states with special bit patterns in HashTable	kleines Filmröllchen
	This simplifies some of the bucket state handling code, as there's now an easy way of checking the basic category of bucket state.
2022-03-31	AK: Rehash HashTable in-place instead of shrinking	kleines Filmröllchen
	As seen on TV, HashTable can get "thrashed", i.e. it has a bunch of deleted buckets that count towards the load factor. This means that hash tables which are large enough for their contents need to be resized. This was fixed in 9d8da16 with a workaround that shrinks the HashTable back down in these cases, as after the resize and re-hash the load factor is very low again. However, that's not a good solution. If you insert and remove repeatedly around a size boundary, you might get frequent resizes, which involve frequent re-allocations. The new solution is an in-place rehashing algorithm that I came up with. (Do complain to me, I'm at fault.) Basically, it iterates the buckets and re-hashes the used buckets while marking the deleted slots empty. The issue arises with collisions in the re-hash. For this reason, there are two kinds of used buckets during the re-hashing: the normal "used" buckets, which are old and are treated as free space, and the "re-hashed" buckets, which are new and treated as used space, i.e. they trigger probing. Therefore, the procedure for relocating a bucket's contents is as follows: - Locate the "real" bucket of the contents with the hash. That bucket is the starting point for the target bucket, and the current (old) bucket is the bucket we want to move. - While we still need to move the bucket: - If we're the target, something strange happened last iteration or we just re-hashed to the same location. We're done. - If the target is empty or deleted, just move the bucket. We're done. - If the target is a re-hashed full bucket, we probe by double-hashing our hash as usual. Henceforth, we move our target for the next iteration. - If the target is an old full bucket, we swap the target and to-move buckets. Therefore, the bucket to move is a the correct location and the former target, which still needs to find a new place, is now in the bucket to move. So we can just continue with the loop; the target is re-obtained from the bucket to move. This happens for each and every bucket, though some buckets are "coincidentally" moved before their point of iteration is reached. Either way, this guarantees full in-place movement (even without stack storage) and therefore space complexity of O(1). Time complexity is amortized O(2n) asssuming a good hashing function. This leads to a performance improvement of ~30% on the benchmark introduced with the last commit. Co-authored-by: Hendiadyoin1 <leon.a@serenityos.org>
2022-03-31	AK: Merge HashTable bucket state into one enum	kleines Filmröllchen
	The hash table buckets had three different state booleans that are in fact exclusive. In preparation for further states, this commit consolidates them into one enum. This has the added benefit on not relying on the compiler's boolean packing anymore; we definitely now only need one byte for the bucket state.
2022-03-15	AK+Kernel: Avoid double memory clearing of HashTable buckets	Daniel Bertalan
	Since the allocated memory is going to be zeroed immediately anyway, let's avoid redundantly scrubbing it with MALLOC_SCRUB_BYTE just before that. The latest versions of gcc and Clang can automatically do this malloc + memset -> calloc optimization, but I've seen a couple of places where it failed to be done. This commit also adds a naive kcalloc function to the kernel that doesn't (yet) eliminate the redundancy like the userland does.
2022-03-07	AK: Automatically shrink HashTable when removing entries	Andreas Kling
	If the utilization of a HashTable (size vs capacity) goes below 20%, we'll now shrink the table down to capacity = (size * 2). This fixes an issue where tables would grow infinitely when inserting and removing keys repeatedly. Basically, we would accumulate deleted buckets with nothing reclaiming them, and eventually deciding that we needed to grow the table (because we grow if used+deleted > limit!) I found this because HashTable iteration was taking a suspicious amount of time in Core::EventLoop::get_next_timer_expiration(). Turns out the timer table kept growing in capacity over time. That made iteration slower and slower since HashTable iterators visit every bucket.
2022-03-07	AK: Remove return value from HashTable::remove() and HashMap::remove()	Andreas Kling
	This was only used by remove_all_matching(), where it's no longer used.
2022-03-07	AK: Simplify HashTable::remove_all_matching()	Andreas Kling
	Just walk the table from start to finish, deleting buckets as we go. This removes the need for remove() to return an iterator, which is preventing me from implementing hash table auto-shrinking.
2022-01-29	AK: Support using custom comparison operations for hash compatible keys	Idan Horowitz

2022-01-25	AK: Implement `HashTable::try_ensure_capacity`, as used in `HashMap`	James Puleo
	This was used in `HashMap::try_ensure_capacity`, but was missing from `HashTable`s implementation. No one had used `HashMap::try_ensure_capacity` before so it went unnoticed!
2022-01-05	AK: Make Hash{Map,Table}::remove_all_matching() return removal success	Andreas Kling
	These functions now return whether one or more entries were removed.
2022-01-05	AK: Add HashTable::remove_all_matching(predicate)	Andreas Kling
	This removes all matching entries from a table in a single pass.
2021-12-15	AK: Enable fast path for removal by hash-compatible key in HashMap/Table	Hendiadyoin1

2021-12-15	AK: Allow hash-compatible key types in Hash[Table\|Map] lookup	Hendiadyoin1
	This will allow us to avoid some potentially expensive type conversion during lookup, like form String to StringView, which would allocate memory otherwise.
2021-11-14	AK: Resolve clang-tidy readability-qualified-auto warnings	Andrew Kaster
	... In files included by Kernel/Process.cpp and Kernel/Thread.cpp
2021-11-14	AK: Resolve clang-tidy readability-bool-conversion warnings	Andrew Kaster
	... In files included by Kernel/Process.cpp and Kernel/Thread.cpp
2021-11-11	AK: Allow to clear HashTables/Maps with capacity	Hendiadyoin1

2021-11-11	AK: Make HashTable and HashMap try_* functions return ErrorOr<T>	Andreas Kling
	This allows us to use TRY() and MUST() with them.
2021-10-06	AK: Add missing headers	Ben Wiederhake
	Example failure: IDAllocator.h only pulls in AK/Hashtable.h, so any compilation unit that includes AK/IDAllocator.h without including AK/Traits.h before it used to be doomed to fail with the cryptic error message "In instantiation of 'AK::HashTable<T, TraitsForT, IsOrdered>::Iterator AK::HashTable<T, TraitsForT, IsOrdered>::find(const T&) [with T = int; TraitsForT = AK::Traits: incomplete type 'AK::Traits<int>' used in nested name specifier".
2021-09-10	AK: Mark HashTable::size_in_bytes() as constexpr	Hendiadyoin1

2021-09-10	AK: Add OOM safe interface to HashTable/Map	Hediadyoin1
	This adds a new HashSetResult only returned by try_set, to signal allocation failure during setting.
2021-09-07	Everywhere: Behaviour => Behavior	Andreas Kling

2021-07-21	AK: Remove unused private HashTable::lookup_for_reading()	Andreas Kling

2021-07-21	AK: Sprinkle [[nodiscard]] on HashMap and HashTable	Andreas Kling

2021-07-13	HashTable: Rename finders with a more accurate and self-descripting name	ngc6302h

2021-07-11	AK: Use kfree_sized() in AK::HashTable	Andreas Kling

2021-06-15	AK: Add Ordering support to HashTable and HashMap	Hediadyoin1
	Adds a IsOrdered flag to Hashtable and HashMap, which allows iteration in insertion order
2021-06-09	AK: Allow changing the HashTable behaviour for sets on existing entries	Idan Horowitz
	Specifically, replacing the existing entry or just keeping it and canceling the set.
2021-05-30	AK: Make HashTable::operator=(HashTable&&) clear the moved-from table	Andreas Kling
	This is consistent with how other AK containers behave when moved from.
2021-05-15	AK+LibC: Implement malloc_good_size() and use it for Vector/HashTable	Gunnar Beutner
	This implements the macOS API malloc_good_size() which returns the true allocation size for a given requested allocation size. This allows us to make use of all the available memory in a malloc chunk. For example, for a malloc request of 35 bytes our malloc would internally use a chunk of size 64, however the remaining 29 bytes would be unused. Knowing the true allocation size allows us to request more usable memory that would otherwise be wasted and make that available for Vector, HashTable and potentially other callers in the future.
2021-04-22	Everything: Move to SPDX license identifiers in all files.	Brian Gianforcaro
	SPDX License Identifiers are a more compact / standardized way of representing file license information. See: https://spdx.dev/resources/use/#identifiers This was done with the `ambr` search and replace tool. ambr --no-parent-ignore --key-from-file --rep-from-file key.txt rep.txt *
2021-04-11	AK: Annotate HashTable functions as [[nodiscard]]	Brian Gianforcaro

2021-04-11	AK: Make HashTable with capacity constructor explicit	Brian Gianforcaro

2021-04-02	AK: Inline HashTable writing bucket lookup	thislooksfun
	The old approach was more complex and also had a very bad edge case with lots of collisions. This approach eliminates that possiblility. It also makes both reading and writing lookups a little bit faster.
2021-04-02	AK: Inline the bucket index calculation	thislooksfun
	The result of the modulo is only used in the array index, so why make the code more complex by calculating it in two different places?
2021-03-12	Everywhere: Remove klog(), dbg() and purge all LogStream usage :^)	Andreas Kling
	Good-bye LogStream. Long live AK::Format!
2021-02-23	Everywhere: Rename ASSERT => VERIFY	Andreas Kling
	(...and ASSERT_NOT_REACHED => VERIFY_NOT_REACHED) Since all of these checks are done in release builds as well, let's rename them to VERIFY to prevent confusion, as everyone is used to assertions being compiled out in release. We can introduce a new ASSERT macro that is specifically for debug checks, but I'm doing this wholesale conversion first since we've accumulated thousands of these already, and it's not immediately obvious which ones are suitable for ASSERT.
2021-01-31	HashTable: Correctly pass args to set	Lenny Maiorani
	Problem: - Using regular functions rather than function templates results in the arguments not being deduced. This then requires the same function to be written multiple times and for `move` to be used rather than `forward`. Solution: - Collapse multiple function overloads to a single function template with a deduced argument. This allows the argument to be a forwarding reference and bind to either an l-value or r-value and forward the value.
2021-01-12	AK: Simplify constructors and conversions from nullptr_t	Lenny Maiorani
	Problem: - Many constructors are defined as `{}` rather than using the ` = default` compiler-provided constructor. - Some types provide an implicit conversion operator from `nullptr_t` instead of requiring the caller to default construct. This violates the C++ Core Guidelines suggestion to declare single-argument constructors explicit (https://isocpp.github.io/CppCoreGuidelines/CppCoreGuidelines#c46-by-default-declare-single-argument-constructors-explicit). Solution: - Change default constructors to use the compiler-provided default constructor. - Remove implicit conversion operators from `nullptr_t` and change usage to enforce type consistency without conversion.
2020-10-18	AK: Reduce memory writes in HashTable destructor	Dano Perniš

2020-10-18	AK: Implement HashTable assignment in terms of swap	Dano Perniš

2020-10-18	AK: Provide swap() for HashTable	Dano Perniš