I rarely take pictures in black and white. I need to up my camera game. My old Canon Powershot Elph 300HS point and shoot has been showing its age for a while now. I can still take good pictures with it but it requires an increasing amount of effort, patience and adequate lighting. Things which are hard to come by when shooting wild like I do most of the time. Plus I do not like to do post-processing on my pictures so if I can get them in a decent quality straight out of the camera then that would save me a ton of time.
Post processing is all the rage now with the rise of instagram filters and snap chat. The less real a picture looks the more interesting it is to the human eye. You can watch a video of a trip to Paris and then you visit it and you realize that the colours do not pop as much as you thought they would. But alas a new camera with a bigger sensor will bring less noise and the added in body stabilization I will end up with fewer blurry pictures. Now I just have to find the perfect compromise between size, price and quality.
Yamfoot commented: My goodness Owen! You take the prize for the longest blog ever!
I was looking for something on my computer and found the text of a blog post I had done. And there was a link to this which you had written 15 years ago!!!!!!!!
Thanks for the memories. I always like to go back in the past and read things I wrote or things about me.
Keep blogging
http://www.owensoft.net/v4/user/91/ ... read 1 more
The main goal of this re-index was to be able to find the keywords "salary scale" on the mof website - mission accomplished!
As for the code I had to make distinctions between ignored words and ignored websites because the test to see if a link was "trash" was getting really slow so I had to split out the filter into 2 parts: for malformed links I want to skip and the other for websites that I do not want to crawl (but still want to list).
Stats; Links count: 29016. Words: 6594 => 10555. References to [jamaica] was 15029 now 59093. Index size was 14.5 mb is now 36.8mb. Almost a 50% file size increase and considering I am quickly running out of server space. I uploaded it and was promptly welcomed with this message;
"Fatal error: Allowed memory size of 268435456 bytes exhausted (tried to allocate 77 bytes)"
I had to reduce the file size so that it can fit in memory. Yes I load the entire file into memory for each search. So I looked at the index data array to see what I could possible throw away. I was collecting some meta tags from the html pages in the index so I threw those out and got the file down to 25mb. I thought of streaming the file into memory but that is currently now possible since it is serialized. If push comes to shove I may have to resort to a new file formate in the future.
I finally went down into the awfully bloated jis.gov.jm website but I had to block cirt.gov.jm because it is pure link spam. Everytime I look at this code I find a new bug or edgecase. I am not sure how modern programmers get things done on projects which have so many complications. Yeah I could use the a search API and apply some filters but where is the fun in that? I would be already done but what would I have learned? Nothing.