data analysis Detecting bots in online voting Bots can ruin even the best competition. Their advantages are scale and speed. And it is very hard to compete with them. So it can discourage genuine participants. Usually, as a competition organizer,
neural network How to install CUDA and TensorFlow GPU on Ubuntu 16.04 I remember when 4 years ago, I was trying to configure CUDA on a laptop with Ubuntu 14.04 and Nvidia Optimus technology - it was a quite tough process. Several times I messed up so much that it was easier to reinstall the whole OS to have a fresh start than trying to undo everything.
Ghost Free hosting for Ghost This post won't be related to main blog topics. However, recently Red Hat has closed OpenShift v2 platform where my blog was hosted and I had to migrate it to the new version
R profvis - code profiling in R Usually, when you run R code interactively, it is easy to spot which parts of the script are the most time-consuming. Nevertheless, for functions or nested loops, it may be not so obvious.
R multidplyr - dplyr meets parallel processing Note: I assume that you are familiar with dplyr. If not, I suggest using first the following tutorial: https://cran.rstudio.com/web/packages/dplyr/vignettes/introduction.html. Intro dplyr is one of
R R - Heat maps with ggplot2 Heat maps are a very useful graphical tool to better understand or present data stored in matrix in more accessible form. E.g. they are very helpful during seeking/comparing missing values in
R R - parallel computing in 5 minutes (with foreach and doParallel) Parallel computing is easy to use in R thanks to packages like doParallel. However, before we decide to parallelize our code, still we should remember that there is a trade-off between simplicity and
Python Kaggle Otto Group Product Classification Challenge Just finished Otto competition on Kaggle in which took a part 3514 teams. Participiants had to classify products to one from nine categories based on data provided by e-commerce company and had 2
R R - devtools and RCurl If you see during devtools (or any other R package) installation on Ubuntu these sort of errors: * installing *source* package ‘RCurl’ ... ** package ‘RCurl’ successfully unpacked and MD5 sums checked checking for curl-config... no
CUDA Ubuntu 14.04 - install OpenCV with CUDA Today I'll show you how to compile and install OpenCV with support for Nvidia CUDA technology which will allow you to use GPU to speed up image processing. I assume that you already
R Object serialization in R Saving and restoring objects in R is simple and sometimes it might be very helpful. Especially if you want to keep results from a very time-consuming analysis which obviously you don't want to
Torch Torch7 - Reading CSV into tensor Loading content from CSV files in Torch is not as easy as it should be (at least for Lua beginner). I started with csvigo module and wanted to load data, firstly, into table