NOTEPAD

Peer Review Continuous Detoxification

2018-08-11T00:00:00+00:00

Peer Review

This is my review of the work wuwei has done during the summer and his blog post. I have found his work very interesting and there are a few things we can talk about.

Main ideas

The Transformers API is nicely implemented. The old converter and preprocessor API was essentially doing the same things hence a unified class was needed. .fit and .transform are simple to use from swig too here. I agree the ref-ing is sometimes a lot of trouble to deal with (for example here), hence this work really shines in its simplicity inspite being huge.

The pipeline API can be used to chain transformers and machines together as shown here. Because something like

p.over("submean", transformer("PruneVarSubMean")).then("kmeans", machine("KMeans"))

is possible in Python the API is a lot similar to that of sklearn. I really liked this. Also the keywords over and then make the fact that pipeline will need a pair of transformer and machine. Overall it is definitely user friendly.
The pipeline is easy to use with cross validation here too.

The custom exceptions that were introduced will lead to some creative error handling in shogun. The idea of seperating logical_errors with out_of_range exceptions is an informative and clean. The idea is simply to throw a std exception instead of always using ShogunException with the help of two macros SG_THROW and REQUIRE_E. I think we can write about using these exceptions all over shogun as a future work somewhere because this will make a nice task for contributors.

The view template for feature and labels is a nice step towards immutable features. Making a shadow copy of the features ensures features are not modified. The best thing is they are const during view here. I also liked returning Some as subset features. No one likes dealing with refs/ unrefs. The duplicate method here is not causing data copying overhead. Also it is more clear how to use this since add_subset and then remove_subset can be complicated to work with.

Untemplated linalg has a lot of ideas I found very interesting. The idea behind lazy evaluation is to deal with types at runtime. An untemplated Vector and Matrix which can be converted to templated instances of SGVector and SGMatrix using an operator. I like how the code for basic things like dot, add, multiply is already there demonstrating the idea’s feasibility. I had some trouble about how Exp came into the picture here some hints for that will make it more clearer. I found it a bit difficult to diffrentiate between implicit and explicit evaluations of expressions (here for example).
This is where the lazy evaluation truly shines in the sense that the expression is not evaluated until it is “needed” to be assigned to a Vector. I think the more we eval implicitly the better a few thoughts about that will be helpful. The recursive simplicity of eval method is very intuitive. Solve lhs, solve rhs, apply the operator. An explicit eval here is however a price we pay for being lazy it seems.

Maybe we can add a few words about future works and ideas to the blog. This will help pick things up later. Ideas like meta example for inverse transform API, using custom exceptions shogun-wide, or the systematic tests with all features and labels view can be a few good candidates candidates.

Conclusion

I have enjoyed working at shogun with you this summer. I appreciate the complexity of work that has been done over the summer. Your work during the project has kept me excited and helped me work more efficiently. I look forward to writing more code together.

Peer Review Continuous Detoxification was originally published by Shubham Shukla at NOTEPAD on August 11, 2018.

Final Report Inside the Black Box

2018-08-10T00:00:00+00:00

Overview

Name: Shubham Shukla
Project: Inside The Black Box
Mentors: Heiko Strathmann, Giovanni De Toni
Organization: Shogun Machine Learning Toolbox

Abstract

Shogun is a large scale machine learning toolkit developed by many diffrent minds and ideas. This means we have a lot of opportunities to optimize what goes on under the hood and create something simple and impactful. This project focuses a bit more on Iterative Algorithms among other things. The premature stopping framework was improved to make it more robust and natural to use and modify, the progress bar was improved to make it more verbose along with implementing it in some iterative algorithms, We also worked on making algorithms respect the provided feature types making them fully templated for a more generic behaviour.

StoppableSGObject class and progress bar
Iterative Machine
Feature type dispatching and generic nature
Other Contributions and Ideas

StoppableSGObject class and progress bar

In shogun we have a SignalHandler that gives some control of what happens in case of an event, like premature cancellation (CTRL+C), back to the user for some algorithms. We take all the premature stopping code and make it accessible to non CMachine types as well by placing it in a new CStoppableSGObject class. My mentor also introduced m_callback data member which can accept a lambda function that can serve as a way to fire a cancel computation signal.

Relevant PRs:

The class is implemented in PR4280. Other updates include

#4286: replacing cancel_computation calls with macro
#4287: enable premature stopping in all machines
#4291: use case for CStoppableSGObject class

Progress bar

We added a default prefix for class_name::method_name to the progress bar. This slightly changes the usage which is now done using SG_PROGRESS macro instead of progress method. We implemented it in most iterative algorithms. This helped us prepare a list of iterative algorithms
PR: #4305: The new macro and its usage Future Work: Finding more use cases for it and using it to extend the list of iterative algorithms.

Iterative Machine

Previously an algorithm could define what happens when a training process is cancelled or paused with the help of methods like on_next, on_pause etc. This is not flexible for a user behind an interface like shogun. We use the concept of mixins for the first time in shogun to write a new CIterativeMachine class which allows the user to cancel training anytime, execute some more code, and then resume it later if needed. The pre-trained model remains usable and concurrent. This is done by making sure the model updates its state in every iteration.

Relevant PRs:

The mixin is implemented in PR4335. Related PRs are:

#4320: update the state of Perceptron in every iteration
PR4335: this also includes the implementation of CIterativeMachine in CPerceptron
#4347: CIterativeMachine in CNewtonSVM class.

Future Work:

Porting all algorithms from the List of Iterative Algorithms wiki page to this code style
Systematic tests for all Iterative machines that ensure proper state update and concurrency.

Feature type dispatching and generic nature

There is an implicit assumption in most algorithms that the provided feature type will be 64 bit dense. To introduce more generic behaviour in an automated way we have written some new classes. These are all mixins that use the curiously recursive template pattern. This means algorithms will inherit from themselves and the orignal base class. The concept is new to shogun and brings a lot more possibilities for similar ideas. The idea is to dispatch feature types from base class and then have subclasses implement a templated version of train_machine.

Relevant PRs:

#4373: This implements two mixin + crtp classes to dispatch dense and string feature types. We also implement dense featue types in CLDA, and CLeastAngleRegression. There are also unit tests for training a machine will a list of feature types.
#4389: a meta example for dense dispatcher
Future Work:
Add more dispatchers with tests along with implementations.
An automated way to create a new dispatcher class or maybe a workaround so that we don’t need to have as many dispatchers as the number of feature types.

Other Contributions and Ideas:

Observer and put

Shogun has an API to set values of member parameters that are registered by algorithms. This API can be used together with parameter observers to record summaries of change in member variables. This can be done using a new put_observe method that will inform the parameter observers on anything that is being updated. This comes with a design change of using put to change member data in code instead of using assignment. To benchmark the overhead we will gain over direct assignment we have written a benchmark in #4342
Future Work:

Prototyping and benchmarking of put_observe
Infering type using Any instead of doing so in CObservedValue

Systematic tests for Iterative machine

Writing seperate tests in multiple classes that aim to do the same thing is redundant. We can use TYPED_TESTS instead to do it for a number of classes. This has a few issues but will be possible in the future. An idea of how to do it is #4327 for serialization tests.
Future Work:

Using LibASTParser to provide more information on base classes
Making a general dataset for proper testing An idea of how these tests should look like is in Perceptron_unittest.cc

NewtonSVM Refactoring

The implementation of CNewtonSVM was outdated. We refactored it to use SGVectors/SGMatrix for data storage and using linalg API for computations. We also ported it to the new IterativeMachine code style.
PRs: We refactored most of the code in #4347. We added a new method to calculate pseudo inverse for matrices with two implementations in linalg #4356

Meta examples and cookbook contributions

These are some contributions to meta examples and cookbooks

Neural Network Factory: Adding the option to auto_initialize the neural network along with a new layer factory to create new layers. The example from python looks much more intuitive now.
PR: #4386 contains the factory example along with a cookbook for training a Convolutional Neural Network on a dataset for mnist images of 0, 1, 2 in shogun. The corresponding dataset is #165
#4346: NewtonSVM meta example
#4340: Diffusion meta cookbook and example
#4310: porting KRRNystrom and LeastAngleRegression to new API
#4297: porting KernelRidgeRegression meta example to new API
Deleting CLabelsFactory: The CLabelsFactory performed static casts which were not needed anymore so we deleted it and used as for conversions
PRs: #4281, #4277
#4278: A few meta examples on using distance machines from factory
#4236: factory methods in lda meta example

Parallel computation of sample in CLogDetEstimator: #4235 We used openMP to make the code parallel along with refactoring it for efficient memory usage

Final Report Inside the Black Box was originally published by Shubham Shukla at NOTEPAD on August 10, 2018.

Google Summer of Code with Shogun

2018-08-05T00:00:00+00:00

We have had a terrific time working together to solve some really cool problems. I found my work to be relevant. This is a short note to my mentors and future students.

For a detailed description of our main ideas please checkout the Featured Posts section of my blog. For an even closer look at the project timeline check out the Weekly Updates category. Here you will find the intermediate ideas and decisions that led us to the final versions which will be helpful for future contributors.

When I first started working on shogun, I worked on small and some large issues. The community was very supportive and helped me with things that were trivial to complicated through detailed and regular feedbacks. This, when mixed with summer of code and some regularity, transformed into brainstorming sessions with my mentors. I had a lot of fun exploring and implementing those ideas into the project. However, not all of those plans have been realized into code. I would love to continue working on them after Summer of Code.

The most important part for a successful project will be good communication with mentors. I have enjoyed that part of this summer to the fullest. Every week we would have a meeting on hangouts sometimes with a document open by the side where we write about new ideas. This went on for hours. The next morning I can’t wait to merge it as soon as I have something that compiles. Ofcourse, the merging part always came after a few days or maybe a week with a lot more thoughts to make it shine. I have learned a lot from my mentors about the way I should be thinking on problems.

I want to thank my mentors Heiko Strathmann and Giovanni De Toni for working with me and always patiently helping me, for understanding my ideas and for realizing them in shogun codebase. I am very excited to see where we take the new developments from here.

Advice for new contributors

The community is welcoming of new contributors. People are very happy to help out and guide you through issues. Do not be afraid to try out new ideas. An explorative approach to things is the best way to understand and then improve upon new ideas. Taking your time to learn about things, realizing that you are stuck and then clearly explaining why that is, will lead to some of the most constructive ideas. Shogun welcomes new contributors to take up new ideas and produce nice code. Being active will always lead to a solution.

Google Summer of Code with Shogun was originally published by Shubham Shukla at NOTEPAD on August 05, 2018.

StoppableSGObject class and progress bar

2018-08-02T00:00:00+00:00

Overview:

The code for premature stopping framework was written in CMachine. We take all of its components together and put them in a seperate object type called CStoppableSGObject. This makes life easier and removes copying of same code.

Motivation:

The stopping framework is useful in classes that do not inherit from CMachine like CMachineEvaluation this makes the idea a lot scalable in terms of usage since all any class needs to do in order to include the whole thing is inherit from CStoppableSGObject. Also, it makes introducing new features, like a callback member function, a lot more easier.

Implementation details and Design choice:

The CStoppableSGObject inherits from CSGObject. It has all the members that were introduced in the premature stopping framework last year. My mentor added a new feature that enables us to register a lambda function as callback whenever a new iteration starts in a loop. This is done by invoking an SG_BLOCK_COMP in the callback when a condition returns True.

Data members and Methods:

Apart from the already present components of premature stopping framework, we introduced a new way to cancel computation of a machine. This will make testing easier and understandable.

m_callback: It is a std::function<bool> which can call cancel_computation() along with generating block signal from the global_signal_handler. An example of callback is:

function<bool()> callback = [this]() 
    {
	// Stop if we did more than 5 steps
	if (m_last_iteration >= 5)
	{
		get_global_signal()->get_subscriber()->on_next(SG_BLOCK_COMP);
		return true;
	}
	m_last_iteration++;
	return false;
};

set_callback: setter for m_callback.

Some Thoughts:

For CIterativeMachine this provides a base for a testing mechanism. The idea is to stop a model using callback and compare it with reference results to test concurrency. This enables us to simulate a user pressing CTRL+C which will help us to write good unit tests along with providing an alternative way to cancel computation. For non-iterative classes this means the COMPUTATIONS_CONTROLLERS macro is still usable to support the signal handler and deal with them in a systematic way without having to write all the code again.

Progress bar macro

Overview and motivation:

The progress bar now has an informative prefix by default. This is more verbose and makes it easier to understand and diffrentiate. This was done by adding a new SG_PROGRESS macro that appends the function name::class name prefix to the progress bar. It is only possible to get the name of the current function being executed hence, this justifies the use of a macro to obtain the name of the caller.

Examples and applying to more algorithms:

The new, smooth progress bar looks like:

CustomKernel::get_kernel_matrix ██████████████████████████████████████████████████████ 100.00% 0.0 seconds

KMeansMiniBatch::minibatch_KMeans ████████████████████████████████████████████████████ 100.00% 0.0 seconds

to use it in a new algorithm we can make the following changes:

Identify a candidate loop that will need the progress bar.
Use the macro in the begining of the loop.

for (auto e : SG_PROGRESS(range(epochs)))

All CIterativeMachine will automatically have a progress bar since we apply it in continue_train()

Future Work:

Use the progress bar anywhere it seems feasible.
Expand the List of Iterative Algorithms while searching for suitable candidates.

StoppableSGObject class and progress bar was originally published by Shubham Shukla at NOTEPAD on August 02, 2018.

Some more ideas around shogun

2018-08-02T00:00:00+00:00

Overview:

This is a collection of some spin-off stories we worked on during the project. They are some nice future ideas and other cool stuff we played with and would like to realize into something better.

Iterative Machine automated tests:

A simple and elegant way to test Iterative Machines is by using 3 reference models. We train the first model till say 5 iterations, the second model till 10 iterations, the third model is to be trained till 10 iterations but is prematurely stopped at 5. The pre-trained model must produce same result as the first reference model. We then call continue_train till the model completes training at 10 iterations. The result must be same as the second reference model. To automate these tests, we will create a generator python file which will collect all CIterativeMachine and create various type lists. These can be used to write TYPED_TESTS which will implement the above idea.

Future work:

Using an AST parser or something similar to generate graphs that provide more information about inheritance
Choosing dataset such that the tests can remain general for all machines that need it.

One test of similar approach is available in Perceptron_unittest.cc

Put and Observe:

The Parameter Observer Framework is a way to observe how the various members change during training. To implement this in an algorithm we would need to call the observe method periodically. The new idea was to use put api to update member variables state and then observe whenever some variable is being put. Basically, we will try to add a new put_observe api that uses the orignal api and also calls observe to update the parameter observers. From an algorithms view, this means we will not update member variables directly (using equals =) instead we will replace the updates with a call to put_observe. We wrote a benchmark to quantify the overhead using put generates with and without the observer.

Future work:

Since the Any api is now better we do can use it in the ObservedValue class for some runtime magic and type deductions.
Prototyping put_observe.

Neural Networks:

We added a new dataset based on mnist images of numbers 0, 1, 2 along with a factory for neural networks. The model trains nicely however using factory does not allow us to connect neural layers in a custom manner.

Future work:

A work around for custom connect of layers. This work around will also be very helpful in porting more meta examples like featureblock_logistic_regression etc.
Remove NeuralNets.i from swig.

NewtonSVM Refactoring:

This was a big spin-off. CNewtonSVM was implemented in an obsolete manner and we wrote it all over again making it new and shinier. This included using linalg for most operations, removing the use of raw pointers and using SGVector, SGMatrix instead, and also implementing our CIterativeMachine code style here. The class is now a lot more readable along with being a classic example for how CIterativeMachine works. We also added pseudo inverse of matrices(seperate self-adjoing and general implementions) to complete the refactor.

Future works:

Enabling the use of svd_bdc in linalg::pinv for faster singular value decomposition, also providing api to get all decomposed matrices.
Efficient memory allocation by making more data members.
Refactoring more such classes

Some more ideas around shogun was originally published by Shubham Shukla at NOTEPAD on August 02, 2018.

Iterative Machine Guide

2018-08-02T00:00:00+00:00

Overview:

Iterative machine enables us to write iterative algorithms that are prematurely stoppable. This means users can cancel the training process any time. The model is still usable and concurrent. This model can then be applied to test data, compared with reference weights and, if needed, can resume training from where it left off earlier. Iterative Machine framework makes using cancelled state models more robust. The idea here is to have iterative models implement only a single iteration of the main training loop instead. This will be called from a while loop in CIterativeMachine class now.

Motivation:

The previous idea was to have different callbacks like on_next, on_pause which will be called based on the user choice obtained from the ShogunSignalHandler prompt. This proves a bit restrictive with respect to what the user can acutally do. Furthermore, it was not possible to define such behaivour from an interface like python. The user has to write some c++ code. Therefore, it is a better approach to allow the user to just cancel training whenever he wants and then also to resume training from where it was left off. This is more flexible to the user and developer as it removes the element of guessing what is meaningful for a user.

Implementation details:

The CIterativeMachine is a mixin class. This means it can inherit from some other class which is passed to it through a template argument to its constructors. Iterative models will now inherit from CIterativeMachine<CMockMachine> instead of being a direct subclass of CMockMachine.

data members:

m_current_iteration: The current iteration count.
m_max_iteration: Maximum number of iterations allowed.
m_complete: If the model has completed training and converged.
methods:
init_model: Virtual, must be written in subclass, this is called before training loop begins to initialize all members.
continue_training: Contains the main training loop which updates m_current_iteration and called the iteration method.
iteration: Virtual, This must be written in subclass and implements a single iteration of training loop.
end_training: An optional method called after training to clean member states or giving warnings etc.

Example:

Below is a cpp example of a fake iterative model.

#include <shogun/base/init.h>
#include <shogun/base/some.h>
#include <shogun/labels/BinaryLabels.h>
#include <shogun/machine/IterativeMachine.h>
#include <iostream>

using namespace shogun;
using namespace std;

// Mock Iterative Algorithm which implements fake methods
class MockModel : public CIterativeMachine<CMachine> 
{
public:
	MockModel() : CIterativeMachine<CMachine>() {}
	~MockModel() {}

protected:
	virtual void init_model(CFeatures * data) 
	{
	    // Initialize members
	    x = 0;
      m_max_iterations = 10;
	}
	virtual void iteration()
	{
	    // Single iteration of training loop
	    x = x + m_labels->get_value(0);
	}
    virtual void end_training()
    {
      // clean member variable states or give warnings and information
      cout<<x<<endl;
      x = 0;
    }
protected:
    float64_t x;
};

int main() 
{
	init_shogun_with_defaults();
  
	// Set up binary labels
	auto labels = some<CBinaryLabels>(SGVector<float64_t>({1, -1}));

	MockModel a;
	a.set_labels(labels);
    cout<<"Training Start..."<<endl;
	a.train();
    // Press CTRL+C before training is complete. Another way to stop training 
    //is to pass a callback that will trigger can trigger a signal.
    
    // Here you can use the pre-trained model. For example we can apply on test data, serialize the model etc.
    cout<<"Resuming Training"<<endl;
    a.continue_train();

	return 0;
}

There are two ways to Prematurely stop an algorithm. The user can press CTRL+C or the user can write a callback method that will trigger a signal. For more details on second method see this patch. From python the code will look like:

from shogun import Perceptron
Perceptron.train(feats)
# Press CTRL+C and you will see something like
# [ShogunSignalHandler] Immediately return to prompt / Prematurely finish computations / Pause current computation / Do nothing (I/C/P/D)?
# Type "C"
# Perform operations like apply on test data, save current model etc
Perceptron.continue_train()

Applying Iterative Machine to more Algorithms:

To use the features of CIterativeMachine with a new Algorithm we can make the following changes:

Use existing machine members (For eg: m_w, bias of CLinearMachine for weights and bias) instead of local member copies. If there are corresponding local members present they must be removed. This is to make sure the model updates its state every iteration.
Identify the main training loop. This is where the magic is happening.
Everything above the loop in training process is a likely candidate for init_model method.
The contents of the loop represent a single iteration hence they will go into iteration

And you are all set.

Future Work:

Porting all iterative models to this code style is the aim here. A list of Iterative Algorithms is available here.
Automated tests for all Iterative Machines. These will include test for correctness of a pre-trained model along with a test to make sure that the model updates its state each iteration.

Iterative Machine Guide was originally published by Shubham Shukla at NOTEPAD on August 02, 2018.

Feature Dispatching Guide

2018-08-02T00:00:00+00:00

Overview:

Most algorithms in shogun do not behave in a generic manner in the sense that they are type dependent. The train method can accept any type of features as a CFeatures* pointer however it is later assumed that the features provided are of a particular type. We intorduced feature dispatching to enable this feature in a more automated way. Some algorithms (like CLDA) already try to take care of types and implement a templated train_machine. We take that idea and give its own space in shogun here.

Motivation:

Least Angle Regression is an Iterative Algorithm that tries to stay type independent. This is a good idea in cases where small feature matrix can be scaled down to float32_t type. Also to implement our Iterative Machine code style here was a problem since it meant having to perform type dispatching in every iteration. This is obviously redundant, even if its cheap it is not a good code style. The idea here is to dispatch feature type in base class (CMachine) so that when we start training loop, types are already taken care of.

An idea to solve such a problem can be using a hiearchy and then making a child class aware of templated types. Other subclasses can overload virtual methods. The idea will not work because we cannot have virtual methods that are templated. Once the run-time system figured out it would need to call a templatized virtual function, compilation is all done and the compiler cannot generate the appropriate instance anymore.
Hence, mixin is a better idea here.

Implementation details and Design choice:

The CDenseRealDispatch is a class to dispatch dense feature types in FeatureDispatchCRTP.h. It is a mixin class that takes two template arguments. First is all the members of the base class hence we definitely need to inherit that. The second is something to bring the templated version of train_machine in scope up the inheritance ladder. Hence we inherit it from the subclass itself. This is possible due to the concept of Curiously Recursive Template Pattern(CRTP). C++ is lazy, this means a pointer for a class is available to use even before it is declared. In other words, a call to a member method of such a class does not need to be instantiated until the function is actually called. This is diffrent from a normal mixin approach that uses a single template argument because not all the methods can be collected with the help of a single class.

Classes(like CLDA, CLeastAngleRegression) which support dynamic dispatching via the mixin will inherit from CDenseRealDispatch<CMockModel, CBaseMachine> instead of directly inheriting from CBaseMachine.

Methods:

train_dense: Virtual, the method is written in CDenseRealDispatch called if the feature class of data pointer is C_DENSE. In the dispatcher this calls train_machine_templated of model with appropriate type.

virtual bool train_dense(CFeatures* data)
{
	auto this_casted = this->template as<P>();
	switch (data->get_feature_type())
	{
	case F_DREAL:
		return this_casted->template train_machine_templated<float64_t>(
		    data->as<CDenseFeatures<float64_t>>());
	case F_SHORTREAL:
		return this_casted->template train_machine_templated<float32_t>(
		    data->as<CDenseFeatures<float32_t>>());
	case F_LONGREAL:
		return this_casted
		    ->template train_machine_templated<floatmax_t>(
		        data->as<CDenseFeatures<floatmax_t>>());
	default:
		SG_SERROR(
		    "Training with %s of provided type %s is not "
		    "possible!",
		    data->get_name(),
		    feature_type(data->get_feature_type()).c_str());
	}
	return false;
}

train_string: Virtual, this is similar to train_dense but it dispatches string types like uint8_t, char.
train_machine_templated: This is a templated version of train_machine written in subclass. It is called with appropriate parameter by the dispatcher.

These methods keep feature class check in the base class and perform feature type checks in mixin. This keeps dense and string features seperate.

There is also an added detail that any class that implements feature type dispatching needs to pass features while calling train(). This is something we want to enforce all over shogun and the mixin seemed a good place to start.

Example and Tests:

A cookbook of how to use a class that supports dispatching is here. The tests for dynamic dispatch use a fake model that returns true when a particular feature type is passed. The feature type is provided in constructor.

class CDenseRealMockMachine
    : public CDenseRealDispatch<CDenseRealMockMachine, CMachine>
{
public:
	CDenseRealMockMachine(EFeatureType f)
	    : CDenseRealDispatch<CDenseRealMockMachine, CMachine>()
	{
		m_expected_feature_type = f;
	}
	~CDenseRealMockMachine()
	{
	}
	template <typename T>
	bool train_machine_templated(CDenseFeatures<T>* data)
	{
		if (data->get_feature_type() == m_expected_feature_type)
			return true;
		return false;
	}
	virtual const char* get_name() const
	{
		return "CDenseRealMockMachine";
	}

	EFeatureType m_expected_feature_type;
};

This is then tested with a few feature types for each dispatcher.

Applying Dispatchers to more Classes:

To implement dense dispatching in more algorithm we can make the following changes:

Port the train_machine call to its templated version train_machine_templated. This can be a bit tricky and involves making the implementation fully templated and making sure the dispatched types are respected.
Inherit from the mixin instead of directly inheriting from base class.
- class CMockModel : public CMockMachine
- class CMockModel : public CDenseRealDispatch<CMockModel, CMockMachine>
Make the class a friend of the Dispatcher. This is done to bring train_machine_templated into the dispatcher’s scope.

friend class CMockModel : public CDenseRealDispatch<CMockModel, CMockMachine>

The idea is similar for CStringFeaturesDispatch. And you are all set with a fully templated model.

Writing a dispatcher for a new feature class F can be done by:

Identify what feature types will make sense to dispatch. This is highly dependent on the feature class we pick.
Add a new method train_F or some other suitable name to CMachine and update CMachine::train() with new feature class.
Add a new dispatcher class to FeatureDispatcherCRTP.h in a similar way as is already done.
Implement it in an algorithm or write a unit test.

Future Work:

Add more dispatchers with tests along with implementing the dispatchers all over shogun.
A nice design improvement would be an automated way to create a new dispatcher class or maybe a workaround so that we don’t need to have as many dispatchers as we have feature types.

Feature Dispatching Guide was originally published by Shubham Shukla at NOTEPAD on August 02, 2018.

Week 10

2018-07-23T00:00:00+00:00

This week we completed the mixin and merged it into develop.

We also made train_machine_templated protected again by making the mixin base class a friend of subclasses.
Also, CWDSVMOcas is not the best candidate for string features since it actually asserts a particular feature type. So we had to get rid of the new changes there.
We added a unit tests for the dispatcher. The strategy is to make a fake machine that implements a templated train_machine. The model returns true if the feature type recieved from train call and the expected feature type. The expected type is set in the machine constructor.
It all worked out pretty well.
There are a few problems we saw like the fact that we will need a new mixin class for each feature class dispatching. There are a lot of feature classes so it does feel a bit redundant. Although it does make sense to keep diffrent feature types seperately too. We will think about this a bit more.
Another problem is when train_machine_templated is called with an illegal type parameter. Such an error will not be caught and this will cause problems in compiling downstream. A solution for this is using another type parameter in train_machine_templated. This defaults to a allowing only certain types like floating points. When we try to call train_machine_templated with something that is not allowed we can throw a ShogunException and avoid messy compiler errors.
On thinking about this a bit more we realized that we need to seperate arithmetic types from floating types. This means a new mixin class for arithmetic. The problem might just scale upwards as we introduce more feature dispatcher.

I also worked on a patch for cookbook in convolutional neural networks. This was a fun patch. First I created a dataset from images of 0, 1, 2 by reading them with opencv and writing them down in matrices. I used some default parameters along with creating two factory. neural_networks and neural_layer. Initially the network did not behave nicely becuase of misleading parameters. I will work with this in the next week.

Contributions

Feature type dispatching through recursive mixin
Neural Layers Cookbook

Week 10 was originally published by Shubham Shukla at NOTEPAD on July 23, 2018.

Week 9

2018-07-16T00:00:00+00:00

This week we came up with a second idea for feature dispatching. The earlier approach was to use macro to generate function names based on train_dense, train_string calls. This is not very automated. Also, the macros are hard to debug since it can be difficult to extract useful information from macros. We realized the problem is solvable via mixins. Idea was to have each machine implement a templated version of train_machine.

This will be called by the mixin depending on feature types. However since it is not possible to have virtual methods that are templated, we found a workaround for our case. We will write the mixins as CRTP classes. CRTP or Curiously recursive template pattern is a C++ dev magic that relies on compiler lazyness to evaluate function calls. This, for our case, means that we can pass the class name as a parameter to its own base class !. As long as a method for subclass is not called the compiler need give an error over this. We can now call the templated train_machine from the base class without making it virtual. Only issue I see is that train_Machine_templated needs to be public now.

We will have a diffrent dispatcher for each feature class. The subclasses will inherit from the dispatcher which accepts 2 template arguments. One would be the subclass itself! This is the magic of CRTP. The second would be a base class like LinearMachine for example. We implemented this for dense features and used the dispatcher in CLeastAngleRegression and CLDA

The macro version of the solution had a merit that the feature class dispatching was done in CMachine. We will combine this idea with the mixin. We added a method train_dense which will be called if Dense features are provided and the machine supports dispatching. The idea remains same forString Features as well. train_dense is implemented in our CDenseRealDispatch mixin. It checks for feature type and calls train_machine_templated with appropriate methods. We also implemented String feature dispatching in CWDSVMOcas

We also wrote a feature_name method which can generate a std::string for various feature classes.

Contributions

Feature type dispatching through recursive mixin

Week 9 was originally published by Shubham Shukla at NOTEPAD on July 16, 2018.

Week 8-End of Phase 2

2018-07-09T00:00:00+00:00

This week we made some major refactors to NewtonSVM class.

These include cleaning up all raw pointers and using SGVector, SGMatrix instead.
Using linalg instead of SGVector, SGMatrix for ops.
Making NewtonSVM iterative.
seperately calculating bias and weights. This was being done in a single matrix till now.
Using weights member of LinearMachine instead of local member. This ensures the model is usable when it is paused.

Next we worked on implementing Pseudo Inverse in linalg.
Any m x n matrix A can be decomposed into A = USVt. If A is self adjoint positive semi definite matrix then a
Symmetrical Self adjoint eigen slover can be used to calculate S and U. The inverse can be expressed as A+ = U * inverse(S)t * Ut.
We have symmetric eigen solver in linalg so we have used that here.

For a general m x n matrix, we have Singular Value Decomposition of A to calculate inverse as A+ = Vt * inverse(S)t * Ut.
This needed to be implemented directly from eigen backend.

With this all of refactoring of NewtonSVM was completed.
We also work on a systematic way to test all Iterative machines. Since they will inherit from Iterativemachine. We can use ctags to
sort them out and apply our test to them.

Contributions

NewtonSVM
pinv in linalg
Iterative machine test

Week 8-End of Phase 2 was originally published by Shubham Shukla at NOTEPAD on July 09, 2018.

Week 7

2018-07-02T00:00:00+00:00

This week we worked on a few more cleanups for our mixin along with finally merging it into develop.
We also wrote a cookbook for converters along with adding a factory for them.

We decided to implement our mixin to another Iterative model as an additonal example.
For this we choose NewtonSVM LinearMachine. The class was a bit updated and we discovered a bug in IterativeMachine where end_training() was not called in case of premature stopping that was fixed in this PR.

The code of NewtonSVM is old with a lot of raw pointers, for memory allocations instead of SGVectors and Matrix along with many places that should use linalg instead of for loops.

As a start we implement the Iterative Machine to it as is. This requires making all the things that are shared over iterations a data member of the subclass and initializing them to avoid memory leaks.
Then implemeting init_model, iteration and end_training methods.

We worked on the benchmark for “put” this week. to implement various test cases we wrote a std::function member that can be provided a lambda function to implement various cases of updates. The benchmark shows that there is negligible loss of resources when updating members with put instead of assignment. This will be used to work along with ParameterObserver in subsequent weeks.

Another example for IterativeMachine is Least Angle Regression. To port it to IterativeMachine framework we will need to make Iteration templated. Currently most classes at shogun do not deal with feature dispatching at all and it is done in a redundant manner in classes that do so like LDA and LARS. To implement that we have tried a few things. We will be using mixins to solve this problem.

Contributions

NewtonSVM
Iterative Machine
Benchmark for put
Coverter factory

Week 7 was originally published by Shubham Shukla at NOTEPAD on July 02, 2018.

Week 6

2018-06-25T00:00:00+00:00

This week we refined and completed IterativeMachine class as a mixin.

We focused on Perceptron as an example to see how things would actually look like when we implement our idea.
The flow begins when the user calls “train” for an Iterative machine which will be perceptron for us.
the train method is implemented in CMachine and calls train_machine with the provided data pointer.
The mixin has 3 data members: m_current_iteration, m_max_iteration and m_complete.
The train_machine is implemented in the subclasses and implements the training process for an algorithm.
In IterativeMachine we implement train_machine for subclasses in the mixin.
Instead the subclasses will now implement 3 methods:

init_model: The subclass can initialize its members here along with any other ops that need to be done before training begins.

iteration: The actual iteration is implemented here.

end_training: this is an optional method which can be used for some additional error handling and/or cleaning states of members.

Data between the three methods is shared with the help of data members of subclass. Doing this has an additional advantage.
We are forced to write code that uses already present data members like m_w(weights), bias of CLinearMachine.
The state is hence being automatically updated every iteration. This keeps the model “current” during a paused state.

The train_machine calls init_model with the features data pointer. This initializes the model parameters.
next it calls continue_train. This is the new element of our class. In continue_train we have the while loop that runs till convergence or maximum iteration. the loop calls the iteration method of subclass again and again.

When the user decides to prematurely cancel computation control is returned to main. The user can perform what ever is need now. For eg he can serialize the incomplete model for comparisions, he can apply the model or some test data, and then simply call continue_train to resume training of the model.

The pull request includes a test in Perceptron that shows the genral idea of how things work now.

Contributions

Iterative Machine

Week 6 was originally published by Shubham Shukla at NOTEPAD on June 25, 2018.

Week 5-End of Phase 1

2018-06-18T00:00:00+00:00

Phase one of GSoc is over this week. We have had a terrific run.

In this post we will look at how we are going to implement our IterativeMachine class and a few issues we faced along with how we tackled them.

The class overides train_machine which calls a virtual method init_model that will perform all the things needed by the main training loop. Communication between the training loop and init_model will be done through member varaibles. Next component is continue_train method. This is where we have our COMPUTATION_CONTROLLERS macro and the main training loop. This will run the single iteration implemented in the model and update state after each iteration.

One issue was how to deal with features and labels. For labels we have used the already present member in CMachine. For features we added a new m_continue_features member as an extra. This is intended to keep things between IterativeMachine and base classes like LinearMachine machine apart.

For now we have only have an IterativeLinearMachine class that inherits from LinearMachine machine and implemented the idea in Perceptron. We will build on it this week.

For end of training ops like cleaning/resetting states, warnings, errors etc. we have an end_training() method that can be overloaded in base class.

The final problem is where to place the IterativeMachine class in the inheritance ladder. The obvious solution appears to be multiple inheritance but we cannot do that since there will be some function overloading and it proved tough. Another method is to implement IterativeMachine as a mixin for machines. A mixin is a class that can inherit from another class dynamically through a template argument. It is not a standalone class but adds more things to a base class. The result is a custom base class. Exactly what we want. This keeps the Iterative Framework minimal. We might not need the extra feature member anymore. Also the class will be keeping up with all API changes we introduce later because it will inherit most of everything from the orignal base classes.

I have implemented this as a work in progress and it is working nicely. I have issues using it in interfaces and that is something we will be looking into this week.

Another thing i worked on this week is a benchmark for “put” using perceptron this gave mixed results and more digging is required.

Contributions

Iterative Machine

Week 5-End of Phase 1 was originally published by Shubham Shukla at NOTEPAD on June 18, 2018.

Week 4

2018-06-11T00:00:00+00:00

This week we discussed deeply into the implementation of our framework along with making a few changes.

After merging #4230 the Perceptron is ready to implement on_pause_impl() of its own.

To start this off I wrote a simple code to serialize a machine whenever the user chooses pause after pressing CTRL+C. The idea here was to simply allow a user to serialize the model in a CFile* member of StoppableObject class. If the user wanted to do something else on pause he could overide the on_pause_impl() method and do it.

This had a few problems:

Most of the time the File member will remain unused which is a bad design.
Problems with file_name and overwriting of files.
Tests: We had earlier worked on reusing the serialization tests and including a test for whether model is stoppable.
However, we will also need a test to check if a model is updating its state properly in each iteration. We came up with one but it still had its own limitations.
Interfaces like Python cannot directly overload the on_pause_impl() methods. User does not have any choice but to write C++ code if he wants to do something else on paused.

With this in mind the current approach needed more thought. So, we decided to introduce the IterativeMachine class to shogun. This will allow the user to only cancel computation and then perform what ever is needed on the intermediate model then if he wants to, continue training again. This approach solves all our problems above.

We do not need to “predict” and prepare for what the user might want to do ideally. So no need for any extra members to StoppableObject. Any issues with filename are now directly in control of the user so that makes it a lot more transparent.

To implement this would mean changing implementation of all iterative algorithms to in a way that they only have a method to run a single iteration and not the whole thing at once. This means an updated state in each iteration is almost neccessary for the model to successfully train. We dont need complicated tests anymore for that.

But the best problem this approach solves is easy flow of code into Interfaces. Earlier we were looking into using DirectorClasses to allow interfaces to overload a method but that feels overkill. Now all the user in python needs to do is stop the training process with CTRL+C, perform whatever is needed, then call continue_train() again. Much simpler to implment and more flexible.

Contributions

helper to serialize machine to ascii

Iterative Machine

Week 4 was originally published by Shubham Shukla at NOTEPAD on June 11, 2018.

Week 3

2018-06-04T00:00:00+00:00

This week we looked into how the algorithms do not update their state properly in each iteration. Due to this we cannot really expect meaningful information during pause since it will just return the initialized values. This will need to be fixed for each algorithm.

That is in each iteration the algorithm needs to update its state so that whenever we pause we get the current values. We have implemented this for perceptron.

Along the way we also wrote a test for hyperparameter initialization of the model as a jump start to it. But this will need to be addressed for all iterative algorithms.

I also refactored the trained model serialization test to include a StoppableObject test that implements a basic test to see algorithms iterative nature. This lead to interesting results. First we cannot use this approach for algorithms that do not need to use the second iteration to converge, we will need to skip there algorithms from testing currently. Second we need to use break instead of continue in the COMPUTATION_CONTROLLERS macro. The second part led to problems with training loops turning into infinite loops as the loop condition is never updated and the while statement never terminates.

This was an easy fix but troublesome to find.

Another task this week was a nice refactor to the progress bar code that changes how its used all over shogun. We now have a prefix like class_name::method_name as default.

This was done by instantiating the progress bar from a macro instead and the using the FUNCTION macro to obtain function name.

We now have a lot broader questions to ask like what will constitute as “state” of the model and how do we plan to update it each iteration.

This week we found a new issue in garbage collection with factory API… or rather the absence of it. I tried adding %newobject to have swig take ownership of the newly created factory object but it is not working. We will need to investigate it further.

Contributions:

Progress bar in iterative algorithms.

Pausing Unit Test.

Garbage collection in swig.

State Update in Perceptron and unit test.

Week 3 was originally published by Shubham Shukla at NOTEPAD on June 04, 2018.

Week 2

2018-05-29T00:00:00+00:00

This week we came up with an idea to easily test Iterative Machines.

The idea here was to provide callback methods that will trigger cancel_computation(). These callbacks will directly send a block signal to the signal handler and we are done.

This is also a nice functionality to have since a user might want to trigger pause/cancel when a condition is satisfied. This makes the framework more user friendly. Along with a lot easier tests.

Currently we use jinja2 to write tests to systematically test a bunch of algorithms. But jinja2 was being dropped from serialization tests. We could use that here because the test we designed will take the algorithm, prematurely stop it with a callback over number of iteration serialize the model, then we will compare results after deserializing it. What we test here is the fact that the algorithm is stoppable. We leverage this on the whether the model triggered the callback i.e. it had an iteration during the training phase.This can serve as a test for whether the model is iterative in nature as well.

As a cookbook patch I ported regression meta examples to the new API.

We also implemented the progress bar in all iterative algorithms too. This was a nice refactor that involved identifying a lot of iterative algorithms :).

Contributions:

Progress bar in iterative algorithms.

Refactor regression meta.

By my mentor

Add set_callback() to StoppableSGObject.

Week 2 was originally published by Shubham Shukla at NOTEPAD on May 29, 2018.

Week 1

2018-05-21T00:00:00+00:00

The first week of coding period ends today and we have an idea about how we are going to be testing the premature stopping Framework. What we have in mind is using a registered callback along with an addcallback method that allows the user to define a custom callback to an algorithm. As toni told me what we want to do ideally is train a model, stop and serialize it, and then calculate the result. Next, we compare it with the result we get by deserializing the model we saved earlier for consistency.

Obviously, doing this for all iterative algorithms seperately is difficult so we are going to be writing some TYPED_TESTS. These can run for a large instances of algorithms without us having to explicitly write them for each instance. This is intuitive because we want to test just the fact that the model is serialized consistently and also the callback will remain same for every instance. Toni has made some edits to the CStoppableSGObject class regarding this.

I will be working on this the following week.

We also ran into an issue because the LinearMachine and KernelMachine were implementing their own version of train instead of using the base class version. Due to this we were not able to write custom on_pause_impl() methods. So I have made a patch to that.

We wrote code for the StoppableSGObject class last week and I implemented it in CMachineEvaluation.

Next we took a look at removing the direct calls to cancel_computation() and replacing them with CANCEL_COMPUTATION macro. During this we ran into an issue with using the macro with const train methods.

The log-det refactoring was finally completed this week. Cheers to that! The final thing we did was making the code more memory efficient while preserving thread-safety. We used a boolean flag that defaults to false. We will set it to true when we negated shifts and just let it be false otherwise. This was a simple hack and provides a lot of memory efficiency due to the fact that earlier we were just allocating a vector memory and then allocating another all over again in the next iteration.

The final thing i did this week was trying to come up with a list of iterative algorithms. This meant going through a lot of code and finding classes with some logical iterative implementation. First I found a list of all algorithms with CMachine as base class. From there I manually go through all the for/while loops and decide if this will need to be visited later in this project along with the line number that the loop starts at. The list is not complete yet but we have enough to start our work. This was added as a Wiki page to shogun. I tried to keep the list short by writing only base classes like say NeuralNetworks/EMBase, we will need to visit all others that use it when we start implementing custom pause/cancel behaviours.

Pull Requests:

#4286 using CANCEL_COMPUTATION macro
replacing cancel_computation() with the macro.

#4291 refactor CMachineEvaluation
removing duplicate code from the class and inheriting from CStoppableSGObject.

#4235 parallel computation of log-det
making all methods called within estimator.sample const along with parallelizing in it.

#4287 connect LinearMachine and KernelMachine to signal Handler
use the train of base class in them.

List of iterative Algorithms

Week 1 was originally published by Shubham Shukla at NOTEPAD on May 21, 2018.

Community Bonding Period

2018-05-13T00:00:00+00:00

The Community Bonding Period is over today and from tommorow the Coding Period starts. In this post I will go through a summary of my Community Bonding.

The first meeting with both of my mentors was very helpful and we made a decision to add a few more things to the work we needed to do.

First was an interface class that would later enable us to scale the Premature Stopping code to classes not inheriting from CMachine. This basically takes all our code and places it in a nice new StoppableSGObject class.

My mentor’s experience proved really helpful in making this easy for me!

Another issue was a reliable testing mechanism for premature stopping. This is still in open discussion and we will be completing it soon.

As a community Bonding excerise all of the new students were required to help out with the new release by porting some meta examples to the new API and also translating some undocumented ones to meta.

We have also decided to get rid of CLabelsFactory in favor of using ‘as’ all over shogun. This was a nice refactoring patch.

Overall this week was terrific for me. I got to work on a lot of cool things. The mentors are very helpful and we would be doing a lot more great stuff this summer.

Pull Requests:

#4236 factory methods in LDA meta example
Refactors the lda meta example to use factory methods.
Refactors LDA to work with Dense and Multiclass Labels instead of just Binary.
Refactor unit-LDA by removing repeting code and also using some all over.

#4277 Delete CLabelsFactory Part1
#4281 Delete CLabelsFactory Part2
Deletes CLabelsFactory in favor of using as in label conversions.

#4280 CStoppableSGObject class
Implements the new CStoppableSGObject base class.

#4278 distance meta examples
Ported a few distance legacy python examples to meta.

Community Bonding Period was originally published by Shubham Shukla at NOTEPAD on May 13, 2018.

The proposal accepted

2018-05-01T00:00:00+00:00

The 14th Google Summer of Code results were announced today and I am thrilled to be working with the people at Shogun.

My proposal to the project Inside the BlackBox was accepted. The major goals I am going to be working on is using premature stopping all over shogun’s codebase.

The backbone for this was beautifully written by my mentor Giovanni De Toni last year. This interests me because debugging ease is something everyone is excited about for the obvious reason that debugging can be frustrating. In machine learning, our models and algorithms do seem like black boxes with an input plugged in and generating meaningful output, especially when we do it in other languages through wrappers.

What we want to achieve is try to take the user along with the algorithm so that he can see what the algorithm is seeing and then make better and informed decisions.

First step would be to list out my domain for this task namely what am I am going to touch and how. This does involve frisking all of shogun’s algorithms and finding the ones we like. Thankfully someone( it was Ken Thompson, cheers to him) created grep. We will be getting familiar with a large number of algorithms and deciding what is meaningful for each of them.

Aside from this I will also be working to complete the trasition to factory methods in meta examples. The new API is definately interesting that our mentors have developed piece by piece. Lets take all that love and give it a place to live in our meta examples. This week I am going to try to have my already open prs merged. I have my final semester exams from next week and they will definately eat up most of my time for now.

We are going to have an IRC session soon enough to discuss how everything is really going to happen. Interacting with my mentors is definately something I have been looking forward to since I first found shogun a few months ago. I am more interested in how they want to go about the project. This will provide me with a proffessional opinion that I am ready to understand and henceforth work with.

I am also reading previous gsoc blogposts at Shogun, numFOCUS and other organisations to get a feel of how this summer is going to unfold. This is a pretty interesting and fun thing to do. Its always a great time trying to dive into the minds of people you are going to be working with/like. :D

The proposal accepted was originally published by Shubham Shukla at NOTEPAD on May 01, 2018.

NOTEPAD

Peer Review Continuous Detoxification

Peer Review

Main ideas

Conclusion

Final Report Inside the Black Box

Overview

Abstract

Table of Contents

StoppableSGObject class and progress bar

Relevant PRs:

Progress bar

Iterative Machine

Relevant PRs:

Future Work:

Feature type dispatching and generic nature

Relevant PRs:

Future Work:

Other Contributions and Ideas:

Observer and put

Systematic tests for Iterative machine

NewtonSVM Refactoring

Meta examples and cookbook contributions

Google Summer of Code with Shogun

Advice for new contributors

StoppableSGObject class and progress bar

Overview:

Motivation:

Implementation details and Design choice:

Data members and Methods:

Some Thoughts:

Progress bar macro

Overview and motivation:

Examples and applying to more algorithms:

Future Work:

Some more ideas around shogun

Overview:

Iterative Machine automated tests:

Future work:

Put and Observe:

Future work:

Neural Networks:

Future work:

NewtonSVM Refactoring:

Future works:

Iterative Machine Guide

Overview:

Motivation:

Implementation details:

data members:

methods:

Example:

Applying Iterative Machine to more Algorithms:

Future Work:

Feature Dispatching Guide

Overview:

Motivation:

Implementation details and Design choice:

Methods:

Example and Tests:

Applying Dispatchers to more Classes:

Future Work:

Week 10

Contributions

Week 9

Contributions

Week 8-End of Phase 2

Contributions

Week 7

Contributions

Week 6

Contributions

Week 5-End of Phase 1

Contributions

Week 4

Contributions

Week 3

Contributions:

Week 2

Contributions: