Java-Collections

每次看都应该有不同的体会

概览

总体上来看集合应该分为两种 Collection 和 Map

Collection

列表
- ArrayList，可变长数组
- LinkedList，基于双向链表实现
- Vector，类似于 ArrayList，源码中使用了 synchronize，线程安全
队列
- Queue，使用 LinkedList 实现了 Queue 接口
- PriorityQueue，优先队列，使用堆来实现
集合
- HashSet，基于 HashMap 实现，查找复杂度为 O(n)
- TreeSet，基于 TreeMap 实现，键值有序，查找时间复杂度 O(log2(n))

Map

HashMap，用拉链法解决哈希冲突，非线程安全
TreeMap，用红黑树实现，非线程安全
HashTable，使用了 synchronize 关键字，线程安全，效率低，每次访问锁定整个表
ConcurrentHashMap，使用分段锁，线程安全，相对于 HashTable，效率高
LinkedHashMap，使用双向链表维护键的顺序

具体

ArrayList

常用成员函数

add(T t)
remover(T t)/remover(int index)

如何扩容

ArrayList 底层是使用数组实现的。这里 transient 关键字修饰 elementData 数组，主要是因为 elementData 并非总是满的，每次序列化时不应该直接将该数组序列化，因此加了 transient 关键字。

transient Object[] elementData; // non-private to simplify nested class access

初始大小为 10。

private static final int DEFAULT_CAPACITY = 10;

当需要扩容时

private void grow(int minCapacity) {
        // overflow-conscious code
        int oldCapacity = elementData.length;
        int newCapacity = oldCapacity + (oldCapacity >> 1);
        if (newCapacity - minCapacity < 0)
            newCapacity = minCapacity;
        if (newCapacity - MAX_ARRAY_SIZE > 0)
            newCapacity = hugeCapacity(minCapacity);
        // minCapacity is usually close to size, so this is a win:
        elementData = Arrays.copyOf(elementData, newCapacity);
    }

可以看出 ArrayList 每次扩容 ` int newCapacity = oldCapacity + (oldCapacity » 1);，扩容 1.5 倍。然后将原来维护的数组拷贝到新创建的空间中elementData = Arrays.copyOf(elementData, newCapacity);`。因此如果提前知道 ArrayList 的大小，可以在初始化 ArrayList 的时候指定。

删除元素

删除指定位置元素后，会将该位置之后的元素都前进一个位置。

 private void fastRemove(int index) {
        modCount++;
        int numMoved = size - index - 1;
        // 不是最后一个元素的时候，将被删除元素之后的元素挨个前移
        if (numMoved > 0)
            System.arraycopy(elementData, index+1, elementData, index,
                             numMoved);
        elementData[--size] = null; // clear to let GC do its work
    }

fast-fail 机制

ArrayList 类从 AbstractList 类中继承而来 protected transient int modCount = 0;，每次改变 ArrayList 数组中的元素时都递增 modCount 的值。主要是在 writeObject 函数中检查序列化前后 ArrayList 是否发生了变化。

private void writeObject(java.io.ObjectOutputStream s)
        throws java.io.IOException{
        // 缓存序列化之前的 modCount 值
        int expectedModCount = modCount;
        s.defaultWriteObject();

        // Write out size as capacity for behavioural compatibility with clone()
        s.writeInt(size);

        // Write out all elements in the proper order.
        for (int i=0; i<size; i++) {
            s.writeObject(elementData[i]);
        }
		// 序列化之后，查看 modCount 的值是否和序列化之前的相同
        // 如果不相同，说明在序列化期间，有别的线程对 ArrayList 进行了修改
        // 因此抛出异常
        if (modCount != expectedModCount) {
            throw new ConcurrentModificationException();
        }
    }

LinkedList

常用函数

add(T t)
remover(T t)/remover(int index)

Vector

常用函数

add(T t)
remover(T t)/remover(int index)

Queue

常用函数

add(T t)/offer(T t)，添加一个元素，其实 offer 函数内部也是用 add 实现的
poll()，返回第一个元素并删除，如果没有，返回 null
peek()，返回第一个元素，如果没有，返回 null

PriorityQueue

常用函数

add(T t)/offer(T t)，添加一个元素，其实 offer 函数内部也是用 add 实现的
poll()，返回第一个元素并删除，如果没有，返回 null
peek()，返回第一个元素，如果没有，返回 null

如何实现优先

堆

HashMap

常用成员函数

put(key, val)
get(key)
containsKey(key)

如何计算 hashCode

    static final int hash(Object key) {
        int h;
        return (key == null) ? 0 : (h = key.hashCode()) ^ (h >>> 16);
    }

上述代码表明 HashMap 的 key 是可以是 null 的，只要是 null 即返回 0。非零的情况下 key.hashCode() ^ (h>>>16)

添加元素

    final V putVal(int hash, K key, V value, boolean onlyIfAbsent,
                   boolean evict) {
        Node<K,V>[] tab;
        Node<K,V> p;
        int n, i;
        if ((tab = table) == null || (n = tab.length) == 0) // 如果是初始状态，将创建 table
            n = (tab = resize()).length;
        if ((p = tab[i = (n - 1) & hash]) == null)  // 如果对应的键值为 null，那么直接将 tab[i] 置为 node
            tab[i] = newNode(hash, key, value, null);
        else {
            Node<K,V> e;
            K k;
            if (p.hash == hash &&
                ((k = p.key) == key || (key != null && key.equals(k))))
                e = p;
            else if (p instanceof TreeNode) // 如果是树的节点，将插入 (key,value)
                e = ((TreeNode<K,V>)p).putTreeVal(this, tab, hash, key, value);
            else {
                for (int binCount = 0; ; ++binCount) {
                    if ((e = p.next) == null) { // 遍历到最后一个元素
                        p.next = newNode(hash, key, value, null); // 使用尾插法
                        if (binCount >= TREEIFY_THRESHOLD - 1) // -1 for 1st
                            treeifyBin(tab, hash); // 如果链表超过了长度，那么将链表重构成红黑树
                        break;
                    }
                    if (e.hash == hash &&
                        ((k = e.key) == key || (key != null && key.equals(k)))) // 如果链表中是否有相同的 (key,value) 直接返回
                        break;
                    p = e;
                }
            }
            if (e != null) { // existing mapping for key
                V oldValue = e.value;
                if (!onlyIfAbsent || oldValue == null)
                    e.value = value;
                afterNodeAccess(e);
                return oldValue;
            }
        }
        ++modCount;
        if (++size > threshold)
            resize();
        afterNodeInsertion(evict);
        return null;
    }

从上面的代码可以得知：

hashMap 不允许存在相同的 (key, value) 组合存在
每个 hash 值对应的链表长度如果超过 TREEFY_THRESHOLD 将会把链表重构为红黑树
当还是链表时，插入新元素使用尾插法
最后两个参数的含义还未搞懂

"Java 集合类的基本使用和内部实现"

概览

Collection

Map

具体

ArrayList

常用成员函数

如何扩容

删除元素

fast-fail 机制

LinkedList

常用函数

Vector

常用函数

Queue

常用函数

PriorityQueue

常用函数

如何实现优先

HashMap

常用成员函数

如何计算 hashCode

添加元素

TreeMap

常用成员函数

key 如何有序

HashSet

常用函数

TreeSet

常用函数

ConcurrentHashMap

常用成员函数

分段锁如何实现

LinkedHashMap

常用成员函数

如何实现 LRU

FEATURED TAGS

FRIENDS