[STL Unique] regarding the duplicated elements
Question on what happens to the extra elements that are found to be duplicates. Specifically, the last sentence in the following paragraph.
So it's dereferenceable but you get garbage for those elements?
Every time a consecutive group of duplicate elements appears in the range [first, last), the algorithm unique removes all but the first element. That is, unique returns an iterator new_last such that the range [first, new_last) contains no two consecutive elements that are duplicates.  The iterators in the range [new_last, last) are all still dereferenceable, but the elements that they point to are unspecified.
in SGI's example, they created new iterator.
vector<int>::iterator new_end = unique(V.begin(), V.end());
what if I reused the end() iterator?
V.end() = unique(V.begin(), V.end());
is this causing memory leak from the new end() to the old end()?
Let's say there are 10 total elements and 5 are duplicates. So according to the SGI description, the size() will still be 10 after unique. What's the preferred way to get the right size? do you always have to involve an extra erase command after unique?
Last edited by rssmps; 12-26-2005 at 05:40 AM.
unique rearranges the container, pushing duplicates past the new logical end, but not erasing them. it returns an iterator that marks the new logical end of the container, i.e., the first non-unique element (which is one position past the last unique element). so if you have the following sequence:
the rerranged container after calling unique is:
the new_end points to the fifth element (instead of the sixth, as in the origianl container. remmeber that end() always returns one past the last valid element)
The iterators aren't strictly invalidated because the container doesn't reallocate or shrink. However, the values to which the iterators point might change, so if for instance *it was 5, it could now be 2. The values of the dereferenced iterators are unspecified because you can't tell how the duplicates (i.e., the elements at new_end and higher) are ordered. So if you access such an iterator, your code will not crash but the value is unknown, or at least not portably known.
This implies that in order to remove the duplciates you have to call erase (new_end, end) and if you want to release the memory occupied by these elements you have to use the self-swapping idiom described here:
Last edited by Danny; 12-26-2005 at 04:08 PM.
OK, I get it.
The swap is a neat trick!
Last edited by rssmps; 12-27-2005 at 02:51 AM.
Last Post: 10-31-2005, 02:04 PM
By J.C. Tierney in forum Enterprise
Last Post: 01-14-2002, 07:32 AM
Last Post: 08-03-2001, 01:06 PM
Last Post: 04-30-2001, 02:27 AM
Last Post: 01-30-2001, 12:47 PM
Top DevX Stories
Easy Web Services with SQL Server 2005 HTTP Endpoints
JavaOne 2005: Java Platform Roadmap Focuses on Ease of Development, Sun Focuses on the "Free" in F.O.S.S.
Wed Yourself to UML with the Power of Associations
Microsoft to Add AJAX Capabilities to ASP.NET
IBM's Cloudscape Versus MySQL