User Tools

Site Tools


clean_history

Simple way to remove duplicate entries from your history file

sorts by url, then uniques (ignoring timestamps) and sorts again. result: new history file where you only have the earliest visit of all url's you visited more then once.

sort -k3 history | uniq -f2 | sort > history.new

Sort by URL, sort by number of occurrences of each URL, take the 5000 most visited:

sort -k3 history | uniq -f2 -c | sort -n | sed 's/[0-9 ]* //' | tail -n 5000 > history.new
clean_history.txt · Last modified: 2016/08/31 14:10 (external edit)