Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit

Overview

CNTK

Chat Windows build status Linux build status
Join the chat at https://gitter.im/Microsoft/CNTK Build Status Build Status

The Microsoft Cognitive Toolkit (https://cntk.ai) is a unified deep learning toolkit that describes neural networks as a series of computational steps via a directed graph. In this directed graph, leaf nodes represent input values or network parameters, while other nodes represent matrix operations upon their inputs. CNTK allows users to easily realize and combine popular model types such as feed-forward DNNs, convolutional nets (CNNs), and recurrent networks (RNNs/LSTMs). It implements stochastic gradient descent (SGD, error backpropagation) learning with automatic differentiation and parallelization across multiple GPUs and servers. CNTK has been available under an open-source license since April 2015. It is our hope that the community will take advantage of CNTK to share ideas more quickly through the exchange of open source working code.

Installation

Installing nightly packages

If you prefer to use latest CNTK bits from master, use one of the CNTK nightly packages:

Learning CNTK

You can learn more about using and contributing to CNTK with the following resources:

More information

Disclaimer

Dear community,

With our ongoing contributions to ONNX and the ONNX Runtime, we have made it easier to interoperate within the AI framework ecosystem and to access high performance, cross-platform inferencing capabilities for both traditional ML models and deep neural networks. Over the last few years we have been privileged to develop such key open-source machine learning projects, including the Microsoft Cognitive Toolkit, which has enabled its users to leverage industry-wide advancements in deep learning at scale.

Today’s 2.7 release will be the last main release of CNTK. We may have some subsequent minor releases for bug fixes, but these will be evaluated on a case-by-case basis. There are no plans for new feature development post this release.

The CNTK 2.7 release has full support for ONNX 1.4.1, and we encourage those seeking to operationalize their CNTK models to take advantage of ONNX and the ONNX Runtime. Moving forward, users can continue to leverage evolving ONNX innovations via the number of frameworks that support it. For example, users can natively export ONNX models from PyTorch or convert TensorFlow models to ONNX with the TensorFlow-ONNX converter.

We are incredibly grateful for all the support we have received from contributors and users over the years since the initial open-source release of CNTK. CNTK has enabled both Microsoft teams and external users to execute complex and large-scale workloads in all manner of deep learning applications, such as historical breakthroughs in speech recognition achieved by Microsoft Speech researchers, the originators of the framework.

As ONNX is increasingly employed in serving models used across Microsoft products such as Bing and Office, we are dedicated to synthesizing innovations from research with the rigorous demands of production to progress the ecosystem forward.

Above all, our goal is to make innovations in deep learning across the software and hardware stacks as open and accessible as possible. We will be working hard to bring both the existing strengths of CNTK and new state-of-the-art research into other open-source projects to truly broaden the reach of such technologies.

With gratitude,

-- The CNTK Team

Microsoft Open Source Code of Conduct

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

News

You can find more news on the official project feed

2019-03-29. CNTK 2.7.0

Highlights of this release

  • Moved to CUDA 10 for both Windows and Linux.
  • Support advance RNN loop in ONNX export.
  • Export larger than 2GB models in ONNX format.
  • Support FP16 in Brain Script train action.

CNTK support for CUDA 10

CNTK now supports CUDA 10. This requires an update to build environment to Visual Studio 2017 v15.9 for Windows.

To setup build and runtime environment on Windows:

To setup build and runtime environment on Linux using docker, please build Unbuntu 16.04 docker image using Dockerfiles here. For other Linux systems, please refer to the Dockerfiles to setup dependent libraries for CNTK.

Support advance RNN loop in ONNX export

CNTK models with recursive loops can be exported to ONNX models with scan ops.

Export larger than 2GB models in ONNX format

To export models larger than 2GB in ONNX format, use cntk.Function API: save(self, filename, format=ModelFormat.CNTKv2, use_external_files_to_store_parameters=False) with 'format' set to ModelFormat.ONNX and use_external_files_to_store_parameters set to True. In this case, model parameters are saved in external files. Exported models shall be used with external parameter files when doing model evaluation with onnxruntime.

2018-11-26.
Netron now supports visualizing CNTK v1 and CNTK v2 .model files.

NetronCNTKDark1 NetronCNTKLight1

Project changelog

2018-09-17. CNTK 2.6.0

Efficient group convolution

The implementation of group convolution in CNTK has been updated. The updated implementation moves away from creating a sub-graph for group convolution (using slicing and splicing), and instead uses cuDNN7 and MKL2017 APIs directly. This improves the experience both in terms of performance and model size.

As an example, for a single group convolution op with the following attributes:

  • Input tensor (C, H, W) = (32, 128, 128)
  • Number of output channels = 32 (channel multiplier is 1)
  • Groups = 32 (depth wise convolution)
  • Kernel size = (5, 5)

The comparison numbers for this single node are as follows:

First Header GPU exec. time (in millisec., 1000 run avg.) CPU exec. time (in millisec., 1000 run avg.) Model Size (in KB, CNTK format)
Old implementation 9.349 41.921 38
New implementation 6.581 9.963 5
Speedup/savings Approx. 30% Approx. 65-75% Approx. 87%

Sequential Convolution

The implementation of sequential convolution in CNTK has been updated. The updated implementation creates a separate sequential convolution layer. Different from regular convolution layer, this operation convolves also on the dynamic axis(sequence), and filter_shape[0] is applied to that axis. The updated implementation supports broader cases, such as where stride > 1 for the sequence axis.

For example, a sequential convolution over a batch of one-channel black-and-white images. The images have the same fixed height of 640, but each with width of variable lengths. The width is then represented by sequential axis. Padding is enabled, and strides for both width and height are 2.

 >>> f = SequentialConvolution((3,3), reduction_rank=0, pad=True, strides=(2,2), activation=C.relu)
 >>> x = C.input_variable(**Sequence[Tensor[640]])
 >>> x.shape
     (640,)
 >>> h = f(x)
 >>> h.shape
     (320,)
 >>> f.W.shape
     (1, 1, 3, 3)

Operators

depth_to_space and space_to_depth

There is a breaking change in the depth_to_space and space_to_depth operators. These have been updated to match ONNX specification, specifically the permutation for how the depth dimension is placed as blocks in the spatial dimensions, and vice-versa, has been changed. Please refer to the updated doc examples for these two ops to see the change.

Tan and Atan

Added support for trigonometric ops Tan and Atan.

ELU

Added support for alpha attribute in ELU op.

Convolution

Updated auto padding algorithms of Convolution to produce symmetric padding at best effort on CPU, without affecting the final convolution output values. This update increases the range of cases that could be covered by MKL API and improves the performance, E.g. ResNet50.

Default arguments order

There is a breaking change in the arguments property in CNTK python API. The default behavior has been updated to return arguments in python order instead of in C++ order. This way it will return arguments in the same order as they are fed into ops. If you wish to still get arguments in C++ order, you can simply override the global option. This change should only affect the following ops: Times, TransposeTimes, and Gemm(internal).

Bug fixes

  • Updated doc for Convolution layer to include group and dilation arguments.
  • Added improved input validation for group convolution.
  • Updated LogSoftMax to use more numerically stable implementation.
  • Fixed Gather op's incorrect gradient value.
  • Added validation for 'None' node in python clone substitution.
  • Added validation for padding channel axis in convolution.
  • Added CNTK native default lotusIR logger to fix the "Attempt to use DefaultLogger" error when loading some ONNX models.
  • Added proper initialization for ONNX TypeStrToProtoMap.
  • Updated python doctest to handle different print format for newer version numpy(version >= 1.14).
  • Fixed Pooling(CPU) to produce correct output values when kernel center is on padded input cells.

ONNX

Updates

  • Updated CNTK's ONNX import/export to use ONNX 1.2 spec.
  • Major update to how batch and sequence axes are handled in export and import. As a result, the complex scenarios and edge cases are handled accurately.
  • Updated CNTK's ONNX BatchNormalization op export/import to latest spec.
  • Added model domain to ONNX model export.
  • Improved error reporting during import and export of ONNX models.
  • Updated DepthToSpace and SpaceToDepth ops to match ONNX spec on the permutation for how the depth dimension is placed as block dimension.
  • Added support for exporting alpha attribute in ELU ONNX op.
  • Major overhaul to Convolution and Pooling export. Unlike before, these ops do not export an explicit Pad op in any situation.
  • Major overhaul to ConvolutionTranspose export and import. Attributes such as output_shape, output_padding, and pads are fully supported.
  • Added support for CNTK's StopGradient as a no-op.
  • Added ONNX support for TopK op.
  • Added ONNX support for sequence ops: sequence.slice, sequence.first, sequence.last, sequence.reduce_sum, sequence.reduce_max, sequence.softmax. For these ops, there is no need to expand ONNX spec. CNTK ONNX exporter just builds computation equivalent graphs for these sequence ops.
  • Added full support for Softmax op.
  • Made CNTK broadcast ops compatible with ONNX specification.
  • Handle to_batch, to_sequence, unpack_batch, sequence.unpack ops in CNTK ONNX exporter.
  • ONNX tests to export ONNX test cases for other toolkits to run and to validate.
  • Fixed Hardmax/Softmax/LogSoftmax import/export.
  • Added support for Select op export.
  • Added import/export support for several trigonometric ops.
  • Updated CNTK support for ONNX MatMul op.
  • Updated CNTK support for ONNX Gemm op.
  • Updated CNTK's ONNX MeanVarianceNormalization op export/import to latest spec.
  • Updated CNTK's ONNX LayerNormalization op export/import to latest spec.
  • Updated CNTK's ONNX PRelu op export/import to latest spec.
  • Updated CNTK's ONNX Gather op export/import to latest spec.
  • Updated CNTK's ONNX ImageScaler op export/import to latest spec.
  • Updated CNTK's ONNX Reduce ops export/import to latest spec.
  • Updated CNTK's ONNX Flatten op export/import to latest spec.
  • Added CNTK support for ONNX Unsqueeze op.

Bug or minor fixes:

  • Updated LRN op to match ONNX 1.2 spec where the size attribute has the semantics of diameter, not radius. Added validation if LRN kernel size is larger than channel size.
  • Updated Min/Max import implementation to handle variadic inputs.
  • Fixed possible file corruption when resaving on top of existing ONNX model file.

.Net Support

The Cntk.Core.Managed library has officially been converted to .Net Standard and supports .Net Core and .Net Framework applications on both Windows and Linux. Starting from this release, .Net developers should be able to restore CNTK Nuget packages using new .Net SDK style project file with package management format set to PackageReference.

The following C# code now works on both Windows and Linux:

 >>> var weightParameterName = "weight";
 >>> var biasParameterName = "bias";
 >>> var inputName = "input";
 >>> var outputDim = 2;
 >>> var inputDim = 3;
 >>> Variable inputVariable = Variable.InputVariable(new int[] { inputDim }, DataType.Float, inputName);
 >>> var weightParameter = new Parameter(new int[] { outputDim, inputDim }, DataType.Float, 1, device, weightParameterName);
 >>> var biasParameter = new Parameter(new int[] { outputDim }, DataType.Float, 0, device, biasParameterName);
 >>> 
 >>> Function modelFunc = CNTKLib.Times(weightParameter, inputVariable) + biasParameter;

For example, simply adding an ItemGroup clause in the .csproj file of a .Net Core application is sufficient: >>> >>> >>> >>> netcoreapp2.1 >>> x64 >>> >>> >>> >>> >>> >>> >>>

Bug or minor fixes:

  • Fixed C# string and char to native wstring and wchar UTF conversion issues on Linux.
  • Fixed multibyte and wide character conversions across the codebase.
  • Fixed Nuget package mechanism to pack for .Net Standard.
  • Fixed a memory leak issue in Value class in C# API where Dispose was not called upon object destruction.

Misc

2018-04-16. CNTK 2.5.1

Repack CNTK 2.5 with third party libraries included in the bundles (Python wheel packages)


2018-03-15. CNTK 2.5

Change profiler details output format to be chrome://tracing

Enable per-node timing. Working example here

  • per-node timing creates items in profiler details when profiler is enabled.
  • usage in Python:
import cntk as C
C.debugging.debug.set_node_timing(True)
C.debugging.start_profiler() # optional
C.debugging.enable_profiler() # optional
#<trainer|evaluator|function> executions
<trainer|evaluator|function>.print_node_timing()
C.debugging.stop_profiler()

Example profiler details view in chrome://tracing ProfilerDetailWithNodeTiming

CPU inference performance improvements using MKL

  • Accelerates some common tensor ops in Intel CPU inference for float32, especially for fully connected networks
  • Can be turned on/off by cntk.cntk_py.enable_cpueval_optimization()/cntk.cntk_py.disable_cpueval_optimization()

1BitSGD incorporated into CNTK

  • 1BitSGD source code is now available with CNTK license (MIT license) under Source/1BitSGD/
  • 1bitsgd build target was merged into existing gpu target

New loss function: hierarchical softmax

  • Thanks @yaochengji for the contribution!

Distributed Training with Multiple Learners

  • Trainer now accepts multiple parameter learners for distributed training. With this change, different parameters of a network can be learned by different learners in a single training session. This also facilitates distributed training for GANs. For more information, please refer to the Basic_GAN_Distributed.py and the cntk.learners.distributed_multi_learner_test.py

Operators

  • Added MeanVarianceNormalization operator.

Bug fixes

  • Fixed convergence issue in Tutorial 201B
  • Fixed pooling/unpooling to support free dimension for sequences
  • Fixed crash in CNTKBinaryFormat deserializer when crossing sweep boundary
  • Fixed shape inference bug in RNN step function for scalar broadcasting
  • Fixed a build bug when mpi=no
  • Improved distributed training aggregation speed by increasing packing threshold, and expose the knob in V2
  • Fixed a memory leak in MKL layout
  • Fixed a bug in cntk.convert API in misc.converter.py, which prevents converting complex networks.

ONNX

  • Updates
    • CNTK exported ONNX models are now ONNX.checker compliant.
    • Added ONNX support for CNTK’s OptimizedRNNStack operator (LSTM only).
    • Added support for LSTM and GRU operators
    • Added support for experimental ONNX op MeanVarianceNormalization.
    • Added support for experimental ONNX op Identity.
    • Added support for exporting CNTK’s LayerNormalization layer using ONNX MeanVarianceNormalization op.
  • Bug or minor fixes:
    • Axis attribute is optional in CNTK’s ONNX Concat operator.
    • Bug fix in ONNX broadcasting for scalars.
    • Bug fix in ONNX ConvTranspose operator.
    • Backward compatibility bug fix in LeakyReLu (argument ‘alpha’ reverted to type double).

Misc

  • Added a new API find_by_uid() under cntk.logging.graph.

2018-02-28. CNTK supports nightly build

If you prefer to use latest CNTK bits from master, use one of the CNTK nightly package.

Alternatively, you can also click corresponding build badge to land to nightly build page.


2018-01-31. CNTK 2.4

Highlights:

  • Moved to CUDA9, cuDNN 7 and Visual Studio 2017.
  • Removed Python 3.4 support.
  • Added Volta GPU and FP16 support.
  • Better ONNX support.
  • CPU perf improvement.
  • More OPs.

OPs

  • top_k operation: in the forward pass it computes the top (largest) k values and corresponding indices along the specified axis. In the backward pass the gradient is scattered to the top k elements (an element not in the top k gets a zero gradient).
  • gather operation now supports an axis argument
  • squeeze and expand_dims operations for easily removing and adding singleton axes
  • zeros_like and ones_like operations. In many situations you can just rely on CNTK correctly broadcasting a simple 0 or 1 but sometimes you need the actual tensor.
  • depth_to_space: Rearranges elements in the input tensor from the depth dimension into spatial blocks. Typical use of this operation is for implementing sub-pixel convolution for some image super-resolution models.
  • space_to_depth: Rearranges elements in the input tensor from the spatial dimensions to the depth dimension. It is largely the inverse of DepthToSpace.
  • sum operation: Create a new Function instance that computes element-wise sum of input tensors.
  • softsign operation: Create a new Function instance that computes the element-wise softsign of a input tensor.
  • asinh operation: Create a new Function instance that computes the element-wise asinh of a input tensor.
  • log_softmax operation: Create a new Function instance that computes the logsoftmax normalized values of a input tensor.
  • hard_sigmoid operation: Create a new Function instance that computes the hard_sigmoid normalized values of a input tensor.
  • element_and, element_not, element_or, element_xor element-wise logic operations
  • reduce_l1 operation: Computes the L1 norm of the input tensor's element along the provided axes.
  • reduce_l2 operation: Computes the L2 norm of the input tensor's element along the provided axes.
  • reduce_sum_square operation: Computes the sum square of the input tensor's element along the provided axes.
  • image_scaler operation: Alteration of image by scaling its individual values.

ONNX

  • There have been several improvements to ONNX support in CNTK.
  • Updates
    • Updated ONNX Reshape op to handle InferredDimension.
    • Adding producer_name and producer_version fields to ONNX models.
    • Handling the case when neither auto_pad nor pads atrribute is specified in ONNX Conv op.
  • Bug fixes
    • Fixed bug in ONNX Pooling op serialization
    • Bug fix to create ONNX InputVariable with only one batch axis.
    • Bug fixes and updates to implementation of ONNX Transpose op to match updated spec.
    • Bug fixes and updates to implementation of ONNX Conv, ConvTranspose, and Pooling ops to match updated spec.

Operators

  • Group convolution
    • Fixed bug in group convolution. Output of CNTK Convolution op will change for groups > 1. More optimized implementation of group convolution is expected in the next release.
    • Better error reporting for group convolution in Convolution layer.

Halide Binary Convolution

  • The CNTK build can now use optional Halide libraries to build Cntk.BinaryConvolution.so/dll library that can be used with the netopt module. The library contains optimized binary convolution operators that perform better than the python based binarized convolution operators. To enable Halide in the build, please download Halide release and set HALIDE_PATH environment varibale before starting a build. In Linux, you can use ./configure --with-halide[=directory] to enable it. For more information on how to use this feature, please refer to How_to_use_network_optimization.

See more in the Release Notes. Get the Release from the CNTK Releases page.

Comments
  • Regression Network with Multiple Outputs and UCI Fast Reader

    Regression Network with Multiple Outputs and UCI Fast Reader

    Hi, I am unable to get a regression network with multiple outputs to work using UCIFastReader. Is this possible?

    If I define a network with 2 output nodes like: SimpleNetworkBuilder = [ layerSizes = 12:2 ...

    and a UCIFastReader reader labels section as: labels = [ labelType = "regression" dim = 2 start = 12 labelMappingFile = "$DataDir$/label-mapping.txt" ]

    ...where label-mapping.txt is an empty file.

    When I run the train command it ends with the following: EXCEPTION occurred: NotifyFunctionValuesMBSizeModified: labels InputValue operation had its row dimension 2 changed by the reader to 1.

    If I change config to a network with one output node and change the reader section to dim = 1, then it works, so I think this somehow has to do with not being able to use the UCIFastReader for regression with multiple outputs, if that is the case - is there a way to do this with some different reader?

    area reader 
    opened by amirbegan 58
  • Iteration Plan (September - October 2017)

    Iteration Plan (September - October 2017)

    This plan captures our work from mid September to end of October. We will ship around November 22nd. Major work items of this iteration include ONNX support in CNTK, MKL integration, and many others.

    Endgame

    • November 8: Code freeze for the end game
    • November 22: Release date

    Planned items

    We plan to ship these items at the end of this iteration.

    Legend of annotations:

    | Icon | Description | |----------------- |----------------------------| |

  • [ ] | Item not started | |
  • [x] | Item finished | | 🏃 | Work in progress | | ✋ | Blocked | | 💪 | Stretch |

    Documentation

    • [ ] Finalize learner design and fix related documentation

    System

    • [x] Support import/export ONNX format models
    • [ ] A network optimization API that helps model compression via SVD, quantization, etc.
    • [ ] 16bit support for training on Volta GPU (limited functionality)
    • [ ] C# high-level API design (no implementation)
    • [ ] Reader improvement for large data sets (sequential reader)

    Examples

    • [ ] Faster R-CNN object detection
      • [ ] Clean up the code to use arbitrary input image size
      • [ ] C++ implementation of some Python layers
      • [ ] Usability improvement
    • [ ] New example for natural language processing (NLP)
    • [x] New tutorial on WGAN and LS-GAN
    • [ ] Semantic segmentation (stretch goal)

    Operations

    • [ ] Specify frequency in the number of epochs and minibatches for progress report, validation, checkpoints
    • [ ] Improve statistics for distributed evaluation

    Performance

    • [ ] Intel MKL update to improve inference speed on CPU by around 2x on AlexNet

    Others

    iteration plan 
  • opened by cha-zhang 49
  • Iteration Plan (August - September 2017)

    Iteration Plan (August - September 2017)

    This plan captures our work from early August to mid September. We will ship around September 15th. Major work items of this iteration include Volta 16bit support and C#/.NET API. There will also be numerous other improvements we will make as detailed below.

    Endgame

    • September 11: Code freeze for the end game
    • September 15: Release date

    Planned items

    We plan to ship these items at the end of this iteration.

    Legend of annotations:

    | Icon | Description | |----------------- |----------------------------| |

  • [ ] | Item not started | |
  • [x] | Item finished | | 🏃 | Work in progress | | ✋ | Blocked | | 💪 | Stretch |

    Documentation

    • [ ] Add HTML version of tutorials and manuals so that they can be searchable
    • [ ] Add missing evaluation documents

    System

    ✋ 16bit support for training on Volta GPU (limited functionality)

    • [ ] Update learner interface to simplify parameter setting and adding new learners (Potential breaking change)
    • [x] A preliminary C#/.NET API that enables people to train simple networks such as ConvNet on MNIST.
    • [ ] R-binding for training and evaluation (will be published in a separate repository) ✋ Improve statistics for distributed evaluation

    Examples

    • [ ] Faster R-CNN object detection
      • [ ] Enable arbitrary input image size via free static axis for convolution
      • [ ] C++ implementation of some Python layers
      • [ ] Usability improvement ✋ New example for natural language processing (NLP)
    • [ ] Semantic segmentation (stretch goal)

    Operations

    • [x] Noise contrastive estimation node
    • [ ] Aggregation on sparse gradient for embedded layer
    • [ ] Gradient as an operator (stretch goal)
    • [ ] Reduced rank for convolution in C++ to enable convolution on 1D data
    • [x] Dilated convolution

    Performance

    • [ ] Asynchronous evaluation API (Python and C#) ✋ Intel MKL update to improve inference speed on CPU by around 2x on AlexNet

    Keras and Tensorboard

    • [ ] Example on Keras and SKLearn multi-GPU support on CNTK
    • [ ] Image feature support with Tensorboard for CNTK

    Others

    iteration plan 
  • opened by cha-zhang 40
  • ND convolution and pooling

    ND convolution and pooling

    Hi all, thanks again for releasing this very powerful toolkit. I read the doc and run the MNIST example. In this example the "mnist_convert.py" resizes the 2d digit images in a feature vector of length 28x28. How can I implement something similar for 3d datasets? Would it work out of the box if I transform my data in a feature vector of length NX x NY x NZ? Is 3d convolution already supported?

    area samples area documentation 
    opened by Madgeeno 31
  • How to predict my images using trained-model file??

    How to predict my images using trained-model file??

    I am using ImageNet-sample python code.

    My main code is as follows,

    modelpath = 'BN-Inception_99.model'
    network = create_bn_inception_by_exist_model(modelpath)
    trainer = create_trainer(network, epoch_size, max_epochs, minibatch_size)
    train_source = create_image_mb_source(train_data, mean_data, True, total_number_of_samples=max_epochs * epoch_size)
    test_source = create_image_mb_source(test_data, mean_data, False, total_number_of_samples=FULL_DATA_SWEEP)
    train_re_source = create_image_mb_source(train_re_data, mean_data, False, total_number_of_samples=FULL_DATA_SWEEP)
    

    prediction_TRAIN_AND_TEST_existingmodel(network, trainer, train_source, test_source, train_re_source, progress_printer, max_epochs, minibatch_size, epoch_size, restore, profiler_dir, testing_parameters)

    And, "create_bn_inception_by_exist_model" function is this. def create_bn_inception_by_exist_model(modelpath):

    # Input variables denoting the features and label data
    feature_var = input_variable((NUM_CHANNELS, IMAGE_HEIGHT, IMAGE_WIDTH))
    label_var = input_variable((NUM_CLASSES))
    
    bn_time_const = 4096
    z = load_model(modelpath)
    
    # loss and metric
    ce  = cross_entropy_with_softmax(z, label_var)
    pe  = classification_error(z, label_var)
    pe5 = classification_error(z, label_var, topN=5)
    
    log_number_of_parameters(z)
    print()
    
    return {
        'feature': feature_var,
        'label'  : label_var,
        'ce'     : ce,
        'pe'     : pe,
        'pe5'    : pe5,
        'output' : z
    }
    

    And, "prediction_TRAIN_AND_TEST_existingmodel" function is this. def prediction_TRAIN_AND_TEST_existingmodel(network, trainer, train_source, test_source, train_re_source, progress_printer, max_epochs, minibatch_size, epoch_size, restore, profiler_dir, testing_parameters):

    # define mapping from intput streams to network inputs
    input_map = {
        network['feature']: train_source.streams.features,
        network['label']: train_source.streams.labels
    }
    # process minibatches and evaluate the model
    metric_numer    = 0
    metric_denom    = 0
    sample_count    = 0
    minibatch_index = 0
    
    top_k = 5    # top-5
    
    while sample_count < epoch_size:
        current_minibatch = min(minibatch_size, epoch_size - sample_count)
        # Fetch next test min batch for train data.
        data = train_re_source.next_minibatch(current_minibatch, input_map=input_map)
        # minibatch data to be trained with
        metric_numer += trainer.test_minibatch(data) * current_minibatch      ###### <- error code
    
        ## added edward.cho
        out = cntk.softmax(network['output'])
        print ("out :", out)
        predicted_label_probs = out.eval(data)
    

    BTW, I met some error(###### <- error code) in "prediction_TRAIN_AND_TEST_by_existingmodel" function.

    This is error message. metric_numer += trainer.test_minibatch(data) * current_minibatch File "/home/mirero/anaconda2/lib/python2.7/site-packages/cntk/train/trainer.py", line 223, in test_minibatch return super(Trainer, self).test_minibatch(arguments, device) File "/home/mirero/anaconda2/lib/python2.7/site-packages/cntk/cntk_py.py", line 2466, in test_minibatch return _cntk_py.Evaluator_test_minibatch(self, *args) ValueError: Values for 1 required arguments 'Input('Input3', [#], [3 x 224 x 224])', that the requested output(s) 'Output('aggregateEvalMetric', [], []), Output('Block4979_Output_0', [#], [1])' depend on, have not been provided.

    How should I do modify this code??

    opened by edwardcho 30
  • ValueError: Specified GPU device id (0) is invalid

    ValueError: Specified GPU device id (0) is invalid

    Hi, i have built CNTK GPU successfully, and i can run the FeedForwardNet.py example. However when i try to run the same example using GPU (try_set_default_device(cntk.device.gpu(0))) I get the following error :

    Traceback (most recent call last): File "FeedForwardNet.py", line 82, in try_set_default_device(cntk.device.gpu(0)) File "/CNTK/bindings/python/cntk/internal/swig_helper.py", line 69, in wrapper result = f(*args, **kwds) File "/CNTK/bindings/python/cntk/device.py", line 94, in gpu return cntk_py.DeviceDescriptor.gpu_device(device_id) ValueError: Specified GPU device id (0) is invalid.

    Build info:

    	Built time: May  1 2017 22:44:50
    	Last modified date: Mon May  1 12:04:32 2017
    	Build type: release
    	Build target: GPU
    	With 1bit-SGD: no
    	With ASGD: no
    	Math lib: mkl
    	CUDA_PATH: /usr/local/cuda-8.0/
    	CUB_PATH: /usr/local/cub-1.4.1
    	CUDNN_PATH: /usr/local/cudnn-5.1
    	Build Branch: master
    	Build SHA1: 4b9f8739c72068d70279f91b4b59923b2ae1fc3a (modified)
    

    +-----------------------------------------------------------------------------+ | NVIDIA-SMI 375.39 Driver Version: 375.39 | |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | |===============================+======================+======================| | 0 GeForce GTX 675M Off | 0000:01:00.0 N/A | N/A | | 0% 59C P0 N/A / N/A | 357MiB / 1984MiB | N/A Default | +-------------------------------+----------------------+----------------------+

    +-----------------------------------------------------------------------------+ | Processes: GPU Memory | | GPU PID Type Process name Usage | |=============================================================================| | 0 Not Supported | +-----------------------------------------------------------------------------+

    Anyone has faced this issue ? Thanks.

    opened by youssefhb 27
  • FastRCNN with CSEvalClient Rois question

    FastRCNN with CSEvalClient Rois question

    Hi,

    I'm working on CSEvalClient and I some questions regarding this piece of code:

    EvaluateObjectDetectionModel(); https://github.com/Microsoft/CNTK/commit/7eefb18dce9a6f8bf4e020f0dde37acb403e16dd

    1. This rois are the selective search rois from the original image projected to the resize image(1000x1000) and divide by 1000 to get the float values, right? The original image is WIN_20160803_11_28_42_Pro.jpg with 1080x1920

    // parse rois: groups of 4 floats corresponding to (x, y, w, h) for an ROI string roiCoordinates = "0.219 0.0 0.165 0.29 0.329 0.025 0.07 0.115 0.364 0.0 0.21 0.13 …....

    1. Where can I get the values of the rois obtain by?

    outputs = model.Evaluate(inputs, outDims.First().Key);

    “Outcome for ROI 31: 6 (gerkin) …”
    
    I want the values left,top,width,heigth of ROI number 31!
    

    Thks

    opened by pmfcdb 27
  • MPI error while compiling

    MPI error while compiling

    Hello,

    I'm facing an issue while I compile CNTK with MPI on a Ubuntu 14.04 machine. Here the steps I did:

    ../../configure --1bitsgd=yes
    Defaulting to --with-buildtype=release
    Found cuda at /usr/local/cuda-7.5
    Found gdk at /usr/.
    Found CUB at /usr/local/cub-1.4.1
    Found cuDNN at /usr/local
    Found OpenCV at /usr/local/opencv-3.0.0
    Cannot locate libzip files
    ImageReader will be built without zip container support.
    Generating /home/plu/git/CNTK/build/release/Config.make
    Generating /home/plu/git/CNTK/build/release/Makefile
    run
    >make -j all
    to build
    make all
    

    The error during the make is the following:

    creating /home/plu/git/CNTK/build/release/.build/Source/SGDLib/SGD.o for with build type release
    mpic++ -c Source/SGDLib/SGD.cpp -o /home/plu/git/CNTK/build/release/.build/Source/SGDLib/SGD.o -D_POSIX_SOURCE -D_XOPEN_SOURCE=600 -D__USE_XOPEN2K -std=c++11 -DUSE_CUDNN -DUSE_ACML -DNDEBUG -DNO_SYNC -DQUANTIZED_GRADIENT_AGGREGATION  -msse3 -std=c++0x -fopenmp -fpermissive -fPIC -Werror -fcheck-new -Wno-error=literal-suffix -g -O4 -ISource/Common/Include -ISource/Math -ISource/CNTK -ISource/ActionsLib -ISource/ComputationNetworkLib -ISource/SGDLib -ISource/SequenceTrainingLib -ISource/CNTK/BrainScript -ISource/Readers/ReaderLib -I/usr/./include/nvidia/gdk -I/usr/local/cub-1.4.1 -I/usr/local/cuda-7.5/include -I/usr/local/cuda/include -I/usr/local/acml5.3.1/ifort64_mp/include -I/usr/local/opencv-3.0.0/include -ISource/1BitSGD -MD -MP -MF /home/plu/git/CNTK/build/release/.build/Source/SGDLib/SGD.d
    Source/SGDLib/SGD.cpp: In instantiation of 'void Microsoft::MSR::CNTK::SGD<ElemType>::InitDistGradAgg(int, int) [with ElemType = float]':
    Source/SGDLib/SGD.cpp:2283:16:   required from here
    Source/SGDLib/SGD.cpp:1846:27: error: no matching function for call to 'Microsoft::MSR::CNTK::AllReduceDistGradAggregator<float>::AllReduceDistGradAggregator(std::shared_ptr<Microsoft::MSR::CNTK::MPIWrapper>&, int&, bool&, bool, bool&, int&, int&)'
                 m_distGradAgg = new AllReduceDistGradAggregator<ElemType>(m_mpi, m_numGradientBits, m_zeroThresholdFor1Bit, true /*useQuantizationForSelfStripe*/, m_bufferedAsyncGradientAggregation, traceLevel, m_syncStatsTrace);
                               ^
    Source/SGDLib/SGD.cpp:1846:27: note: candidate is:
    In file included from Source/SGDLib/SGD.cpp:12:0:
    Source/1BitSGD/AllReduceDistGradAggregator.h:43:5: note: Microsoft::MSR::CNTK::AllReduceDistGradAggregator<ElemType>::AllReduceDistGradAggregator(Microsoft::MSR::CNTK::MPIWrapper*, int, bool, bool, bool, int, int) [with ElemType = float]
         AllReduceDistGradAggregator(MPIWrapper* mpi, int nBits, bool zeroThresholdFor1Bit, bool useQuantizationForSelfStripe, bool useAsyncAggregation, int traceLevel, int syncStatsTrace)
         ^
    Source/1BitSGD/AllReduceDistGradAggregator.h:43:5: note:   no known conversion for argument 1 from 'std::shared_ptr<Microsoft::MSR::CNTK::MPIWrapper>' to 'Microsoft::MSR::CNTK::MPIWrapper*'
    Source/SGDLib/SGD.cpp: In instantiation of 'void Microsoft::MSR::CNTK::SGD<ElemType>::InitDistGradAgg(int, int) [with ElemType = double]':
    Source/SGDLib/SGD.cpp:2284:16:   required from here
    Source/SGDLib/SGD.cpp:1846:27: error: no matching function for call to 'Microsoft::MSR::CNTK::AllReduceDistGradAggregator<double>::AllReduceDistGradAggregator(std::shared_ptr<Microsoft::MSR::CNTK::MPIWrapper>&, int&, bool&, bool, bool&, int&, int&)'
                 m_distGradAgg = new AllReduceDistGradAggregator<ElemType>(m_mpi, m_numGradientBits, m_zeroThresholdFor1Bit, true /*useQuantizationForSelfStripe*/, m_bufferedAsyncGradientAggregation, traceLevel, m_syncStatsTrace);
                               ^
    Source/SGDLib/SGD.cpp:1846:27: note: candidate is:
    In file included from Source/SGDLib/SGD.cpp:12:0:
    Source/1BitSGD/AllReduceDistGradAggregator.h:43:5: note: Microsoft::MSR::CNTK::AllReduceDistGradAggregator<ElemType>::AllReduceDistGradAggregator(Microsoft::MSR::CNTK::MPIWrapper*, int, bool, bool, bool, int, int) [with ElemType = double]
         AllReduceDistGradAggregator(MPIWrapper* mpi, int nBits, bool zeroThresholdFor1Bit, bool useQuantizationForSelfStripe, bool useAsyncAggregation, int traceLevel, int syncStatsTrace)
         ^
    Source/1BitSGD/AllReduceDistGradAggregator.h:43:5: note:   no known conversion for argument 1 from 'std::shared_ptr<Microsoft::MSR::CNTK::MPIWrapper>' to 'Microsoft::MSR::CNTK::MPIWrapper*'
    In file included from Source/SGDLib/SimpleEvaluator.h:16:0,
                     from Source/SGDLib/SGD.h:9,
                     from Source/SGDLib/SGD.cpp:6:
    Source/SGDLib/SimpleDistGradAggregator.h: In instantiation of 'void Microsoft::MSR::CNTK::SimpleDistGradAggregator<ElemType>::AggregateGradientsImpl(const std::vector<Microsoft::MSR::CNTK::Matrix<ElemType>*>&, Microsoft::MSR::CNTK::DistGradHeader*, bool) [with ElemType = double]':
    Source/SGDLib/SimpleDistGradAggregator.h:110:129:   required from 'Microsoft::MSR::CNTK::SimpleDistGradAggregator<ElemType>::AggregateGradients(const std::vector<Microsoft::MSR::CNTK::Matrix<ElemType>*>&, Microsoft::MSR::CNTK::DistGradHeader*, int) [with ElemType = double]::__lambda23'
    Source/SGDLib/SimpleDistGradAggregator.h:110:112:   required from 'struct Microsoft::MSR::CNTK::SimpleDistGradAggregator<ElemType>::AggregateGradients(const std::vector<Microsoft::MSR::CNTK::Matrix<ElemType>*>&, Microsoft::MSR::CNTK::DistGradHeader*, int) [with ElemType = double]::__lambda23'
    Source/SGDLib/SimpleDistGradAggregator.h:111:57:   required from 'bool Microsoft::MSR::CNTK::SimpleDistGradAggregator<ElemType>::AggregateGradients(const std::vector<Microsoft::MSR::CNTK::Matrix<ElemType>*>&, Microsoft::MSR::CNTK::DistGradHeader*, int) [with ElemType = double]'
    Source/SGDLib/SGD.cpp:2608:3:   required from here
    Source/SGDLib/SimpleDistGradAggregator.h:283:186: error: 'MPI_Iallreduce' was not declared in this scope
                 MPI_Iallreduce(MPI_IN_PLACE, reductionBuffer, gradients[i]->GetNumElements(), MPIWrapper::GetDataType(reductionBuffer), MPI_SUM, m_mpi->Communicator(), &allReduceRequests[i]) || MpiFail("MPI_Iallreduce");
                                                                                                                                                                                              ^
    Source/SGDLib/SimpleDistGradAggregator.h: In instantiation of 'void Microsoft::MSR::CNTK::SimpleDistGradAggregator<ElemType>::AggregateGradientsImpl(const std::vector<Microsoft::MSR::CNTK::Matrix<ElemType>*>&, Microsoft::MSR::CNTK::DistGradHeader*, bool) [with ElemType = float]':
    Source/SGDLib/SimpleDistGradAggregator.h:110:129:   required from 'Microsoft::MSR::CNTK::SimpleDistGradAggregator<ElemType>::AggregateGradients(const std::vector<Microsoft::MSR::CNTK::Matrix<ElemType>*>&, Microsoft::MSR::CNTK::DistGradHeader*, int) [with ElemType = float]::__lambda23'
    Source/SGDLib/SimpleDistGradAggregator.h:110:112:   required from 'struct Microsoft::MSR::CNTK::SimpleDistGradAggregator<ElemType>::AggregateGradients(const std::vector<Microsoft::MSR::CNTK::Matrix<ElemType>*>&, Microsoft::MSR::CNTK::DistGradHeader*, int) [with ElemType = float]::__lambda23'
    Source/SGDLib/SimpleDistGradAggregator.h:111:57:   required from 'bool Microsoft::MSR::CNTK::SimpleDistGradAggregator<ElemType>::AggregateGradients(const std::vector<Microsoft::MSR::CNTK::Matrix<ElemType>*>&, Microsoft::MSR::CNTK::DistGradHeader*, int) [with ElemType = float]'
    Source/SGDLib/SGD.cpp:2608:3:   required from here
    Source/SGDLib/SimpleDistGradAggregator.h:283:186: error: 'MPI_Iallreduce' was not declared in this scope
    make[1]: *** [/home/plu/git/CNTK/build/release/.build/Source/SGDLib/SGD.o] Error 1
    make[1]: Leaving directory `/home/plu/git/CNTK'
    make: *** [all] Error 2
    

    My $LD_LIBRARY_PATH variable looks like this:

    /usr/local/cuda/lib64:/usr/local/acml5.3.1/ifort64/lib/:/usr/local/acml5.3.1/ifort64_mp/lib/:/usr/local/mpi/lib/
    

    The path where MPI is, is the same than the one proposed here

    Thanks for any help you can provide.

    opened by jplu 27
  • BrainScript extension for Visual Studio Code

    BrainScript extension for Visual Studio Code

    Visual Studio Code is getting a huge momentum. It would be extremely nice if BrainScript had its own VS Code extension. BrainScript is amazing, however, it's still lacking support in the IDE that are actually used by the broader Microsoft dev community. An VS Code extension would significantly boost adoption of BrainScript, the software effort being modest.

    opened by vermorel 26
  • parallelTrain seems do not work

    parallelTrain seems do not work

    Hi, I use about 80 hours speech data to train a two hidden layer neural network with four gpus. There are 2048 nodes in each hidden layer. However, epochtime is almost two and a half hours. It takes too long time to finish an epoch. I wonder if I am doing something wrong. Here is my log file:

       SGD = [
        epochSize = 0
        minibatchSize = 1024
        learningRatesPerSample = 0.000078
        momentumPerMB = 0.9
        dropoutRate = 0.1
        maxEpochs = 50
        L2RegWeight = 0.00001
    
    ParallelTrain = [
    	parallelizationMethod = BlockMomentumSGD
    	distributedMBReading = true
    parallelizationStartEpoch = 1 
    	syncPerfStats = 5
    	BlockMomentumSGD=[
        syncPeriod = 120000
    blockMomentumAsTimeConstant = 1920000
        resetSGDMomentum = true
        useNesterovMomentum = true
        ]
       ]
    
      ]
    

    randomize is set to 1728000 in reader. The command line is: mpiexec -np 4 cntk configFile=config/cntk.config When I use about 10 hours data to train the same DNN, epochtime is only about 10 minutes by a single gpu. I have four kinds of features in the input layer, and two kinds of targets in the output layer. Does the feature reading and error calculating take so much time? Or am I doing something wrong during my training? GPU version is K40m. Thank you.

    opened by xqustc 26
  • parallel LSTM with unique  sequences

    parallel LSTM with unique sequences

    Hello, I’m trying to create two parallel LSTM networks connecting at the output through a multiplier. The sequences supplied to the input LSTMs have different lengths, so I create the input as features = InputVariable({ inputDim }, false, DataType::Float, L"features", {Axis::NewUniqueDynamicAxis(L"d")});

    В итоге получаю ошибку: About to throw exception 'PastValue/FutureValue Function 'PastValue: Output('ElementTimes178_Output_0', [128], [d]) -> U nknown': Input operand 'Output('ElementTimes178_Output_0', [128], [d])' with #dynamic axes != 2 (1 sequence axis and 1 b atch axis) is not supported.'

    std::pair<FunctionPtr, FunctionPtr> LSTMPComponentWithSelfStabilization(Variable input, const NDShape& outputShape, const NDShape& cellShape, const std::function<FunctionPtr(const Variable&)>& recurrenceHookH, const std::function<FunctionPtr(const Variable&)>& recurrenceHookC, const DeviceDescriptor& device) {

    **auto dh = PlaceholderVariable(outputShape, input.DynamicAxes());** //### ERROR HERE
    

    problem in input.DynamicAxes ()

    need to build the following: Seq1(length1) -> LSTM1 -> PoolAvarage1-> ................................................................................................MUL(PoolAvarage1,PoolAvarage2) -> Output -> Seq2(length2) -> LSTM2 -> PoolAvarage2->

    opened by Ivashar 24
  • SequenceClassification.py does not work out of box

    SequenceClassification.py does not work out of box

    I use Visual Studio 2022 to run the SequenceClassification.py. The Python version I am using is 3.6.8, the CNTK version is cntk-gpu-2.7. However, when I run the program these errors came up:

    About to throw exception 'Input file 'F:\purdue\2022_Fall\ECE_62900\final_project\LSTM_GloVe\../../Tests/EndToEndTests/Image/Data\Train_cntk_text.txt.txt' is not open.'
    attempt: Input file 'F:\purdue\2022_Fall\ECE_62900\final_project\LSTM_GloVe\../../Tests/EndToEndTests/Image/Data\Train_cntk_text.txt.txt' is not open., retrying 2-th time out of 5...
    
    About to throw exception 'Input file 'F:\purdue\2022_Fall\ECE_62900\final_project\LSTM_GloVe\../../Tests/EndToEndTests/Image/Data\Train_cntk_text.txt.txt' is not open.'
    attempt: Input file 'F:\purdue\2022_Fall\ECE_62900\final_project\LSTM_GloVe\../../Tests/EndToEndTests/Image/Data\Train_cntk_text.txt.txt' is not open., retrying 3-th time out of 5...
    
    About to throw exception 'Input file 'F:\purdue\2022_Fall\ECE_62900\final_project\LSTM_GloVe\../../Tests/EndToEndTests/Image/Data\Train_cntk_text.txt.txt' is not open.'
    attempt: Input file 'F:\purdue\2022_Fall\ECE_62900\final_project\LSTM_GloVe\../../Tests/EndToEndTests/Image/Data\Train_cntk_text.txt.txt' is not open., retrying 4-th time out of 5...
    
    About to throw exception 'Input file 'F:\purdue\2022_Fall\ECE_62900\final_project\LSTM_GloVe\../../Tests/EndToEndTests/Image/Data\Train_cntk_text.txt.txt' is not open.'
    attempt: Input file 'F:\purdue\2022_Fall\ECE_62900\final_project\LSTM_GloVe\../../Tests/EndToEndTests/Image/Data\Train_cntk_text.txt.txt' is not open., retrying 5-th time out of 5...
    
    About to throw exception 'Input file 'F:\purdue\2022_Fall\ECE_62900\final_project\LSTM_GloVe\../../Tests/EndToEndTests/Image/Data\Train_cntk_text.txt.txt' is not open.'
    Input file 'F:\purdue\2022_Fall\ECE_62900\final_project\LSTM_GloVe\../../Tests/EndToEndTests/Image/Data\Train_cntk_text.txt.txt' is not open.
    

    The call stacks are

    [CALL STACK]
        > 00007FFEF35CD7DB (SymFromAddr() error: Attempt to access invalid address.)
        - 00007FFEF35CEB5D (SymFromAddr() error: Attempt to access invalid address.)
        - 00007FFEF35D5B28 (SymFromAddr() error: Attempt to access invalid address.)
        - 00007FFEF35CE055 (SymFromAddr() error: Attempt to access invalid address.)
        - CreateDeserializer
        - CreateCompositeDataReader (x3)
        - CNTK::  UniversalLearner
        - CNTK::  CreateCompositeMinibatchSource
        - PyInit__cntk_py
        - PyCFunction_FastCallDict
        - PyObject_GetAttr
        - PyEval_EvalFrameDefault
        - PyObject_Free
        - PyFunction_FastCallDict
    

    Stack trace:

    Stack trace:
     >  File "F:\purdue\2022_Fall\ECE_62900\final_project\LSTM_GloVe\LSTM_GloVe.py", line 17, in create_reader
     >    )), randomize=is_training,max_sweeps=INFINITELY_REPEAT if is_training else 1)
     >  File "F:\purdue\2022_Fall\ECE_62900\final_project\LSTM_GloVe\LSTM_GloVe.py", line 52, in train_sequence_classifier
     >    reader = create_reader(path, True, input_dim, num_output_classes)
     >  File "F:\purdue\2022_Fall\ECE_62900\final_project\LSTM_GloVe\LSTM_GloVe.py", line 78, in <module> (Current frame)
     >    error, _ = train_sequence_classifier()
    Loaded 'cntk.io'
    Loaded '__main__'
    Loaded 'runpy'
    The program 'python.exe' has exited with code 0 (0x0).
    

    I have looked at issue #2999. The suggestion is: In the path in your code, add .txt at the end, e.g. '../../Tests/EndToEndTests/Image/Data\Train_cntk_text.txt.txt'. This didn't work for me.

    Are they any other suggestions?

    opened by HaomingYu1997 0
  • Error: could not find all specified 'to_nodes' in clone. Looking for ['relu5'], found [None]

    Error: could not find all specified 'to_nodes' in clone. Looking for ['relu5'], found [None]

    Hello.the script is error when I used VGG16_ImageNet_Caffe.model.so I print the clone_model , I found the base_model was 'outputs: Composite(data: Tensor[3,224,224]) -> Tensor[1000]'.Can you give me some advice

    opened by Thzny 0
  • API document issue due to syntax in source code

    API document issue due to syntax in source code

    In API document, notice that some of the parameters' names are not displayed properly.

    This is due to the single quotes used around "class" directives in source code: https://github.com/microsoft/CNTK/blob/10a8ffcf50d7b9225f3236ffcfdc422b2014fb92/bindings/python/cntk/contrib/crosstalkcaffe/adapter/bvlccaffe/caffeadapter.py#L67.

    We suggest to remove the quotes to resolve the API document issue.

    Related bug: https://dev.azure.com/ceapex/Engineering/_workitems/edit/724859

    opened by kexugit 0
  • A model causes CNTK crash with cudnnSetPoolingNDDescriptor when invoking the pooling operator

    A model causes CNTK crash with cudnnSetPoolingNDDescriptor when invoking the pooling operator

    Hi, we find a VGG model that can cause CNTK crash with cudnnSetPoolingNDDescriptor when invoking the pooling operator in the CNTK below 2.7.0 version. The model may have generated an incorrect parameter during pooling and lacked reasonable error hints. We provide the model file and replay script at this link https://drive.google.com/drive/folders/1RJ7vCO04I2HXgKdHAK0kOaY4OdDJ4Mt7?usp=sharing . We find this bug in CNTK-GPU version 2.7.0 with the keras version 2.2.4. The program execution stack in the crash is as follows: image

    opened by rubbberrabbit 0
  • Bump Newtonsoft.Json from 6.0.8 to 13.0.1 in /Examples/Evaluation/CNTKAzureTutorial01/CNTKAzureTutorial01

    Bump Newtonsoft.Json from 6.0.8 to 13.0.1 in /Examples/Evaluation/CNTKAzureTutorial01/CNTKAzureTutorial01

    Bumps Newtonsoft.Json from 6.0.8 to 13.0.1.

    Release notes

    Sourced from Newtonsoft.Json's releases.

    13.0.1

    • New feature - Add JsonSelectSettings with configuration for a regex timeout
    • Change - Remove portable assemblies from NuGet package
    • Change - JsonReader and JsonSerializer MaxDepth defaults to 64
    • Fix - Fixed throwing missing member error on ignored fields
    • Fix - Fixed various nullable annotations
    • Fix - Fixed annotations not being copied when tokens are cloned
    • Fix - Fixed naming strategy not being used when deserializing dictionary enum keys
    • Fix - Fixed serializing nullable struct dictionaries
    • Fix - Fixed JsonWriter.WriteToken to allow null with string token
    • Fix - Fixed missing error when deserializing JToken with a contract type mismatch
    • Fix - Fixed JTokenWriter when writing comment to an object

    12.0.3

    • New feature - Added support for nullable reference types
    • New feature - Added KebabCaseNamingStrategy
    • Change - Package now uses embedded package icon
    • Fix - Fixed bug when merging JToken with itself
    • Fix - Fixed performance of calling ICustomTypeDescriptor.GetProperties
    • Fix - Fixed serializing Enumerable.Empty and empty arrays on .NET Core 3.0
    • Fix - Fixed deserializing some collection types with constructor
    • Fix - Fixed deserializing IImmutableSet to ImmutableHashSet instead of ImmutableSortedSet
    • Fix - Fixed deserializing IImmutableDictionary to ImmutableDictionary instead of ImmutableSortedDictionary
    • Fix - Fixed deserializing into constructors with more than 256 parameters
    • Fix - Fixed hang when deserializing JTokenReader with preceding comment
    • Fix - Fixed JSONPath scanning with nested indexer
    • Fix - Fixed deserializing incomplete JSON object to JObject
    • Fix - Fixed using StringEnumConverter with naming strategy and specified values

    12.0.2

    • New feature - Added MissingMemberHandling to JsonObjectAttribute and JsonObjectContract
    • New feature - Added constructor to JTokenReader to specify initial path
    • New feature - Added JsonProperty.IsRequiredSpecified
    • New feature - Added JsonContract.InternalConverter
    • Change - Moved embedded debug symbols in NuGet package to a symbol package on NuGet.org
    • Fix - Fixed deserializing nullable struct collections
    • Fix - Fixed memory link when serializing enums to named values
    • Fix - Fixed error when setting JsonLoadSettings.DuplicatePropertyNameHandling to Replace

    12.0.1

    • New feature - Added NuGet package signing
    • New feature - Added Authenticode assembly signing
    • New feature - Added SourceLink support
    • New feature - Added constructors to StringEnumConverter for setting AllowIntegerValue
    • New feature - Added JsonNameTable and JsonTextReader.PropertyNameTable
    • New feature - Added line information to JsonSerializationException
    • New feature - Added JObject.Property overload with a StringComparison
    • New feature - Added JsonMergeSettings.PropertyNameComparison
    • New feature - Added support for multiple Date constructors with JavaScriptDateTimeConverter
    • New feature - Added support for strict equals and strict not equals in JSON Path queries

    ... (truncated)

    Commits

    Dependabot compatibility score

    Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


    Dependabot commands and options

    You can trigger Dependabot actions by commenting on this PR:

    • @dependabot rebase will rebase this PR
    • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
    • @dependabot merge will merge this PR after your CI passes on it
    • @dependabot squash and merge will squash and merge this PR after your CI passes on it
    • @dependabot cancel merge will cancel a previously requested merge and block automerging
    • @dependabot reopen will reopen this PR if it is closed
    • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
    • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
    • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
    • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
    • @dependabot use these labels will set the current labels as the default for future PRs for this repo and language
    • @dependabot use these reviewers will set the current reviewers as the default for future PRs for this repo and language
    • @dependabot use these assignees will set the current assignees as the default for future PRs for this repo and language
    • @dependabot use this milestone will set the current milestone as the default for future PRs for this repo and language

    You can disable automated security fix PRs for this repo from the Security Alerts page.

    dependencies 
    opened by dependabot[bot] 0
Releases(v2.7)
Owner
Microsoft
Open source projects and samples from Microsoft
Microsoft
Official code for "EagerMOT: 3D Multi-Object Tracking via Sensor Fusion" [ICRA 2021]

EagerMOT: 3D Multi-Object Tracking via Sensor Fusion Read our ICRA 2021 paper here. Check out the 3 minute video for the quick intro or the full prese

Aleksandr Kim 276 Dec 30, 2022
Pytorch implementation of YOLOX、PPYOLO、PPYOLOv2、FCOS an so on.

简体中文 | English miemiedetection 概述 miemiedetection是女装大佬咩酱基于YOLOX进行二次开发的个人检测库(使用的深度学习框架为pytorch),支持Windows、Linux系统,以女装大佬咩酱的名字命名。miemiedetection是一个不需要安装的

248 Jan 02, 2023
Pure python implementations of popular ML algorithms.

Minimal ML algorithms This repo includes minimal implementations of popular ML algorithms using pure python and numpy. The purpose of these notebooks

Alexis Gidiotis 3 Jan 10, 2022
ML model to classify between cats and dogs

Cats-and-dogs-classifier This is my first ML model which can classify between cats and dogs. Here the accuracy is around 75%, however , the accuracy c

Sharath V 4 Aug 20, 2021
VOGUE: Try-On by StyleGAN Interpolation Optimization

VOGUE is a StyleGAN interpolation optimization algorithm for photo-realistic try-on. Top: shirt try-on automatically synthesized by our method in two different examples.

Wei ZHANG 66 Dec 09, 2022
A trusty face recognition research platform developed by Tencent Youtu Lab

Introduction TFace: A trusty face recognition research platform developed by Tencent Youtu Lab. It provides a high-performance distributed training fr

Tencent 956 Jan 01, 2023
NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥

NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥

4.8k Jan 07, 2023
Codebase for the Summary Loop paper at ACL2020

Summary Loop This repository contains the code for ACL2020 paper: The Summary Loop: Learning to Write Abstractive Summaries Without Examples. Training

Canny Lab @ The University of California, Berkeley 44 Nov 04, 2022
A foreign language learning aid using a neural network to predict probability of translating foreign words

Langy Langy is a reading-focused foreign language learning aid orientated towards young children. Reading is an activity that every child knows. It is

Shona Lowden 6 Nov 17, 2021
Create and implement a deep learning library from scratch.

In this project, we create and implement a deep learning library from scratch. Table of Contents Deep Leaning Library Table of Contents About The Proj

Rishabh Bali 22 Aug 23, 2022
⚖️🔁🔮🕵️‍♂️🦹🖼️ Code for *Measuring the Contribution of Multiple Model Representations in Detecting Adversarial Instances* paper.

Measuring the Contribution of Multiple Model Representations in Detecting Adversarial Instances This repository contains the code for Measuring the Co

Daniel Steinberg 0 Nov 06, 2022
A collection of papers about Transformer in the field of medical image analysis.

A collection of papers about Transformer in the field of medical image analysis.

Junyu Chen 377 Jan 05, 2023
AniGAN: Style-Guided Generative Adversarial Networks for Unsupervised Anime Face Generation

AniGAN: Style-Guided Generative Adversarial Networks for Unsupervised Anime Face Generation AniGAN: Style-Guided Generative Adversarial Networks for U

Bing Li 81 Dec 14, 2022
TimeSHAP explains Recurrent Neural Network predictions.

TimeSHAP TimeSHAP is a model-agnostic, recurrent explainer that builds upon KernelSHAP and extends it to the sequential domain. TimeSHAP computes even

Feedzai 90 Dec 18, 2022
[ACMMM 2021 Oral] Enhanced Invertible Encoding for Learned Image Compression

InvCompress Official Pytorch Implementation for "Enhanced Invertible Encoding for Learned Image Compression", ACMMM 2021 (Oral) Figure: Our framework

96 Nov 30, 2022
Bottom-up Human Pose Estimation

Introduction This is the official code of Rethinking the Heatmap Regression for Bottom-up Human Pose Estimation. This paper has been accepted to CVPR2

108 Dec 01, 2022
Compute execution plan: A DAG representation of work that you want to get done. Individual nodes of the DAG could be simple python or shell tasks or complex deeply nested parallel branches or embedded DAGs themselves.

Hello from magnus Magnus provides four capabilities for data teams: Compute execution plan: A DAG representation of work that you want to get done. In

12 Feb 08, 2022
ICML 21 - Voice2Series: Reprogramming Acoustic Models for Time Series Classification

Voice2Series-Reprogramming Voice2Series: Reprogramming Acoustic Models for Time Series Classification International Conference on Machine Learning (IC

49 Jan 03, 2023
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

Master status: Development status: Package information: TPOT stands for Tree-based Pipeline Optimization Tool. Consider TPOT your Data Science Assista

Epistasis Lab at UPenn 8.9k Dec 30, 2022
Bootstrapped Representation Learning on Graphs

Bootstrapped Representation Learning on Graphs This is the PyTorch implementation of BGRL Bootstrapped Representation Learning on Graphs The main scri

NerDS Lab :: Neural Data Science Lab 55 Jan 07, 2023