Commits · 54673fd8d719e081536fb531417cd9060de895f0 · ALEIX ROCA NONELL / jemalloc-mod

Mar 10, 2015
- Update ChangeLog. · 54673fd8
  Jason Evans authored Feb 23, 2015
  
  54673fd8
Mar 07, 2015

Fix a chunk_recycle() regression. · 04ca7580

Jason Evans authored Mar 06, 2015

This regression was introduced by
97c04a93 (Use first-fit rather than
first-best-fit run/chunk allocation.).

04ca7580

Use first-fit rather than first-best-fit run/chunk allocation. · 97c04a93

Jason Evans authored Mar 06, 2015

This tends to more effectively pack active memory toward low addresses.
However, additional tree searches are required in many cases, so whether
this change stands the test of time will depend on real-world
benchmarks.

97c04a93

Quantize szad trees by size class. · 5707d6f9

Jason Evans authored Mar 06, 2015

Treat sizes that round down to the same size class as size-equivalent
in trees that are used to search for first best fit, so that there are
only as many "firsts" as there are size classes.  This comes closer to
the ideal of first fit.

5707d6f9

Change default chunk size from 4 MiB to 256 KiB. · f044bb21

Jason Evans authored Mar 06, 2015

Recent changes have improved huge allocation scalability, which removes
upward pressure to set the chunk size so large that huge allocations are
rare. Smaller chunks are more likely to completely drain, so set the
default to the smallest size that doesn't leave excessive unusable
trailing space in chunk headers.

f044bb21

Mar 04, 2015

Preserve LastError when calling TlsGetValue · 4d871f73

Mike Hommey authored Mar 04, 2015

TlsGetValue has a semantic difference with pthread_getspecific, in that it
can return a non-error NULL value, so it always sets the LastError.
But allocator callers may not be expecting calling e.g. free() to change
the value of the last error, so preserve it.

4d871f73

Make --without-export actually work · 7c46fd59

Mike Hommey authored Mar 04, 2015

9906660e added a --without-export configure option to avoid exporting
jemalloc symbols, but the option didn't actually work.

7c46fd59

Feb 26, 2015
- adding support for bitrig · 970fcfbc
  Dave Huseby authored Feb 09, 2015
  
  970fcfbc
Feb 19, 2015
- Fix a compilation error and an incorrect assertion. · 35e3fd9a
  Jason Evans authored Feb 18, 2015
  
  35e3fd9a
- Fix chunk cache races. · 99bd94fb
  Jason Evans authored Feb 18, 2015
```
These regressions were introduced by
ee41ad40 (Integrate whole chunks into
unused dirty page purging machinery.).
```
  99bd94fb
Feb 18, 2015

Rename "dirty chunks" to "cached chunks". · 738e089a

Jason Evans authored Feb 18, 2015

Rename "dirty chunks" to "cached chunks", in order to avoid overloading
the term "dirty".

Fix the regression caused by 339c2b23
(Fix chunk_unmap() to propagate dirty state.), and actually address what
that change attempted, which is to only purge chunks once, and propagate
whether zeroed pages resulted into chunk_record().

738e089a

Fix chunk_unmap() to propagate dirty state. · 339c2b23

Jason Evans authored Feb 17, 2015

Fix chunk_unmap() to propagate whether a chunk is dirty, and modify
dirty chunk purging to record this information so it can be passed to
chunk_unmap().  Since the broken version of chunk_unmap() claimed that
all chunks were clean, this resulted in potential memory corruption for
purging implementations that do not zero (e.g. MADV_FREE).

This regression was introduced by
ee41ad40 (Integrate whole chunks into
unused dirty page purging machinery.).

339c2b23

arena_chunk_dirty_node_init() --> extent_node_dirty_linkage_init() · 47701b22
Jason Evans authored Feb 17, 2015

47701b22
Remove obsolete type arena_chunk_miscelms_t. · eafebfdf
Jason Evans authored Feb 17, 2015

eafebfdf
Simplify extent_node_t and add extent_node_init(). · a4e1888d
Jason Evans authored Feb 17, 2015

a4e1888d

Feb 17, 2015

Integrate whole chunks into unused dirty page purging machinery. · ee41ad40

Jason Evans authored Feb 15, 2015

Extend per arena unused dirty page purging to manage unused dirty chunks
in aaddtion to unused dirty runs.  Rather than immediately unmapping
deallocated chunks (or purging them in the --disable-munmap case), store
them in a separate set of trees, chunks_[sz]ad_dirty.  Preferrentially
allocate dirty chunks.  When excessive unused dirty pages accumulate,
purge runs and chunks in ingegrated LRU order (and unmap chunks in the
--enable-munmap case).

Refactor extent_node_t to provide accessor functions.

ee41ad40

Feb 16, 2015

Remove more obsolete (incorrect) assertions. · 40ab8f98

Jason Evans authored Feb 15, 2015

This regression was introduced by
88fef7ce (Refactor huge_*() calls into
arena internals.), and went undetected because of the --enable-debug
regression.

40ab8f98

Remove obsolete (incorrect) assertions. · cb9b4491

Jason Evans authored Feb 15, 2015

This regression was introduced by
88fef7ce (Refactor huge_*() calls into
arena internals.), and went undetected because of the --enable-debug
regression.

cb9b4491

Fix --enable-debug regression. · 02e5dcf3

Jason Evans authored Feb 15, 2015

Fix --enable-debug to actually enable debug mode.  This regression was
introduced by cbf3a6d7 (Move centralized
chunk management into arenas.).

02e5dcf3

Normalize *_link and link_* fields to all be *_link. · 2195ba4e
Jason Evans authored Feb 15, 2015

2195ba4e

Feb 15, 2015
- Remove redundant tcache_boot() call. · b01186ce
  Jason Evans authored Feb 15, 2015
  
  b01186ce
Feb 14, 2015
- If MALLOCX_ARENA(a) is specified, use it during tcache fill. · 41cfe03f
  Jason Evans authored Feb 13, 2015
  
  41cfe03f
Feb 13, 2015

Take into account the install suffix that jemalloc was built with in the pkg-config file. · feaaa3df
Abhishek Kulkarni authored Feb 11, 2015
```
Signed-off-by: Abhishek Kulkarni <adkulkar@umail.iu.edu>
```
feaaa3df

Put VERSION file in object directory · f8880310

Dan McGregor authored Dec 23, 2014

Also allow for the possibility that there exists a VERSION
file in the srcroot, in case of building from a release tarball
out of tree.

f8880310

Build docs in object directory · ab5e3790
Dan McGregor authored Dec 23, 2014

ab5e3790

Make prof_tctx accesses atomic. · 5f7140b0

Jason Evans authored Feb 12, 2015

Although exceedingly unlikely, it appears that writes to the prof_tctx
field of arena_chunk_map_misc_t could be reordered such that a stale
value could be read during deallocation, with profiler metadata
corruption and invalid pointer dereferences being the most likely
effects.

5f7140b0

Feb 12, 2015

Refactor huge_*() calls into arena internals. · 88fef7ce

Jason Evans authored Feb 12, 2015

Make redirects to the huge_*() API the arena code's responsibility,
since arenas now take responsibility for all allocation sizes.

88fef7ce

add missing check for new_addr chunk size · 1eaf3b6f

Daniel Micay authored Feb 12, 2015

8ddc9329 switched this to over using the
address tree in order to avoid false negatives, so it now needs to check
that the size of the free extent is large enough to satisfy the request.

1eaf3b6f

Move centralized chunk management into arenas. · cbf3a6d7

Jason Evans authored Feb 11, 2015

Migrate all centralized data structures related to huge allocations and
recyclable chunks into arena_t, so that each arena can manage huge
allocations and recyclable virtual memory completely independently of
other arenas.

Add chunk node caching to arenas, in order to avoid contention on the
base allocator.

Use chunks_rtree to look up huge allocations rather than a red-black
tree.  Maintain a per arena unsorted list of huge allocations (which
will be needed to enumerate huge allocations during arena reset).

Remove the --enable-ivsalloc option, make ivsalloc() always available,
and use it for size queries if --enable-debug is enabled.  The only
practical implications to this removal are that 1) ivsalloc() is now
always available during live debugging (and the underlying radix tree is
available during core-based debugging), and 2) size query validation can
no longer be enabled independent of --enable-debug.

Remove the stats.chunks.{current,total,high} mallctls, and replace their
underlying statistics with simpler atomically updated counters used
exclusively for gdump triggering.  These statistics are no longer very
useful because each arena manages chunks independently, and per arena
statistics provide similar information.

Simplify chunk synchronization code, now that base chunk allocation
cannot cause recursive lock acquisition.

cbf3a6d7

Update ckh to support metadata allocation tracking. · f30e261c
Jason Evans authored Feb 12, 2015

f30e261c

Fix a regression in tcache_bin_flush_small(). · 064dbfba

Jason Evans authored Feb 12, 2015

Fix a serious regression in tcache_bin_flush_small() that was introduced
by 1cb181ed (Implement explicit tcache
support.).

064dbfba

Feb 11, 2015
- Remove unnecessary xchg* lock prefixes. · 051eae8c
  Jason Evans authored Feb 10, 2015
  
  051eae8c
Feb 10, 2015

Test and fix tcache ID recycling. · 9e561e8d
Jason Evans authored Feb 10, 2015

9e561e8d

Implement explicit tcache support. · 1cb181ed

Jason Evans authored Jan 29, 2015

Add the MALLOCX_TCACHE() and MALLOCX_TCACHE_NONE macros, which can be
used in conjunction with the *allocx() API.

Add the tcache.create, tcache.flush, and tcache.destroy mallctls.

This resolves #145.

1cb181ed

Fix arena_get() for (!init_if_missing && refresh_if_missing) case. · 23694b07

Jason Evans authored Feb 09, 2015

Fix arena_get() to refresh the cache as needed in the (!init_if_missing
&& refresh_if_missing) case.

This flaw was introduced by the initial arena_get() implementation,
which was part of 8bb3198f (Refactor/fix
arenas manipulation.).

23694b07

Feb 05, 2015

Refactor rtree to be lock-free. · 8d0e04d4

Jason Evans authored Jan 30, 2015

Recent huge allocation refactoring associates huge allocations with
arenas, but it remains necessary to quickly look up huge allocation
metadata during reallocation/deallocation. A global radix tree remains
a good solution to this problem, but locking would have become the
primary bottleneck after (upcoming) migration of chunk management from
global to per arena data structures.

This lock-free implementation uses double-checked reads to traverse the
tree, so that in the steady state, each read or write requires only a
single atomic operation.

This implementation also assures that no more than two tree levels
actually exist, through a combination of careful virtual memory
allocation which makes large sparse nodes cheap, and skipping the root
node on x64 (possible because the top 16 bits are all 0 in practice).

8d0e04d4

Add (x != 0) assertion to lg_floor(x). · c810fcea

Jason Evans authored Feb 04, 2015

lg_floor(0) is undefined, but depending on compiler options may not
cause a crash.  This assertion makes it harder to accidentally abuse
lg_floor().

c810fcea

Refactor base_alloc() to guarantee demand-zeroed memory. · f500a10b

Jason Evans authored Jan 30, 2015

Refactor base_alloc() to guarantee that allocations are carved from
demand-zeroed virtual memory.  This supports sparse data structures such
as multi-page radix tree nodes.

Enhance base_alloc() to keep track of fragments which were too small to
support previous allocation requests, and try to consume them during
subsequent requests.  This becomes important when request sizes commonly
approach or exceed the chunk size (as could radix tree node
allocations).

f500a10b

Reduce extent_node_t size to fit in one cache line. · 918a1a5b
Jason Evans authored Jan 30, 2015

918a1a5b
Implement more atomic operations. · a55dfa4b
Jason Evans authored Feb 02, 2015
```
- atomic_*_p().
- atomic_cas_*().
- atomic_write_*().
```
a55dfa4b