Deeplearning4j: Performs convolutions over the embedded word vectors using multiple filter sizes

Created on 29 Jan 2016  Â·  5Comments  Â·  Source: eclipse/deeplearning4j

Something similar to
http://www.wildml.com/2015/12/implementing-a-cnn-for-text-classification-in-tensorflow/
and the paper "Convolutional Neural Networks for Sentence Classification" link
http://arxiv.org/abs/1408.5882

Some descriptions to make it more clear.
"The first layers embeds words into low-dimensional vectors. The next layer performs convolutions over the embedded word vectors using multiple filter sizes. For example, sliding over 3, 4 or 5 words at a time. Next, we max-pool the result of the convolutional layer into a long feature vector, add dropout regularization, and classify the result using a softmax layer."

Thanks.

Enhancement

Most helpful comment

Right. That's what I was mentioning earlier. Mainly flagging for later.

On Fri, Jan 29, 2016 at 4:28 PM, Alex Black [email protected]
wrote:

Just a heads up: this sort of architecture should be relatively
straightforward with the computation graph architecture we've been working
on. The idea with that is that you define a network using an arbitrary
directed acyclic graph connection structure, instead of just a stack of
layers as in MultiLayerNetwork.
This should be merged to master soon.

—
Reply to this email directly or view it on GitHub
https://github.com/deeplearning4j/deeplearning4j/issues/1088#issuecomment-177030604
.

All 5 comments

Just a heads up: this sort of architecture should be relatively straightforward with the computation graph architecture we've been working on. The idea with that is that you define a network using an arbitrary directed acyclic graph connection structure, instead of just a stack of layers as in MultiLayerNetwork.
This should be merged to master soon.

Right. That's what I was mentioning earlier. Mainly flagging for later.

On Fri, Jan 29, 2016 at 4:28 PM, Alex Black [email protected]
wrote:

Just a heads up: this sort of architecture should be relatively
straightforward with the computation graph architecture we've been working
on. The idea with that is that you define a network using an arbitrary
directed acyclic graph connection structure, instead of just a stack of
layers as in MultiLayerNetwork.
This should be merged to master soon.

—
Reply to this email directly or view it on GitHub
https://github.com/deeplearning4j/deeplearning4j/issues/1088#issuecomment-177030604
.

Addressed with comp graph

Hi,
is there any code example for this particular problem you could share?
thanks,
ahmet

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

Was this page helpful?
0 / 5 - 0 ratings

Related issues

AlexDBlack picture AlexDBlack  Â·  4Comments

taraliza picture taraliza  Â·  6Comments

turambar picture turambar  Â·  3Comments

atuzhykov picture atuzhykov  Â·  4Comments

AlexDBlack picture AlexDBlack  Â·  4Comments