scipy.sparse.hstack（（[1]，[2]）） - >“ValueError：blocks must be 2-D”。为什么？

问题描述：

scipy.sparse.hstack((1, [2]))和scipy.sparse.hstack((1, [2]))很好，但不是scipy.sparse.hstack(([1], [2]))。为什么会这样？scipy.sparse.hstack（（[1]，[2]）） - >“ValueError：blocks must be 2-D”。为什么？

这里是发生了什么事我的系统上的跟踪：

C:\Anaconda>python 
Python 2.7.10 |Anaconda 2.3.0 (64-bit)| (default, May 28 2015, 16:44:52) [MSC v. 
1500 64 bit (AMD64)] on win32 
>>> import scipy.sparse 
>>> scipy.sparse.hstack((1, [2])) 
<1x2 sparse matrix of type '<type 'numpy.int32'>' 
     with 2 stored elements in COOrdinate format> 
>>> scipy.sparse.hstack((1, 2)) 
<1x2 sparse matrix of type '<type 'numpy.int32'>' 
     with 2 stored elements in COOrdinate format> 
>>> scipy.sparse.hstack(([1], [2])) 
Traceback (most recent call last): 
    File "<stdin>", line 1, in <module> 
    File "C:\Anaconda\lib\site-packages\scipy\sparse\construct.py", line 456, in h 
stack 
    return bmat([blocks], format=format, dtype=dtype) 
    File "C:\Anaconda\lib\site-packages\scipy\sparse\construct.py", line 539, in b 
mat 
    raise ValueError('blocks must be 2-D') 
ValueError: blocks must be 2-D 
>>> scipy.version.full_version 
'0.16.0' 
>>>

我想你的意思是'hstack（（[1]，2））'起作用。 – hpaulj

@hpaulj谢谢，修复！ –

我们有没有回答你的问题？ – rayryeng

答

编码细节：

def hstack(blocks ...): 
    return bmat([blocks], ...) 

def bmat(blocks, ...): 
    blocks = np.asarray(blocks, dtype='object') 
    if blocks.ndim != 2: 
     raise ValueError('blocks must be 2-D') 
    (continue)

所以想你的替代品（记住额外[]）：

In [392]: np.asarray([(1,2)],dtype=object) 
Out[392]: array([[1, 2]], dtype=object) 

In [393]: np.asarray([(1,[2])],dtype=object) 
Out[393]: array([[1, [2]]], dtype=object) 

In [394]: np.asarray([([1],[2])],dtype=object) 
Out[394]: 
array([[[1], 
     [2]]], dtype=object) 

In [395]: _.shape 
Out[395]: (1, 2, 1)

最后一种情况（你的问题的情况下）失败，因为结果是3D。

随着2点稀疏矩阵（预期输入）：

In [402]: np.asarray([[a,a]], dtype=object) 
Out[402]: 
array([[ <1x1 sparse matrix of type '<class 'numpy.int32'>' 
    with 1 stored elements in COOrdinate format>, 
     <1x1 sparse matrix of type '<class 'numpy.int32'>' 
    with 1 stored elements in COOrdinate format>]], dtype=object) 

In [403]: _.shape 
Out[403]: (1, 2)

hstack被趁着bmat格式的，通过转动矩阵的列表转换矩阵的嵌套（2d）的列表。 bmat旨在将稀疏矩阵的二维数组组合成一个更大的矩阵。跳过首先制作这些稀疏矩阵的步骤可能会或可能不会工作。代码和文档不作任何承诺。

答

在scipy.sparse.hstack((1, [2]))第一种情况下，数字1被解释为一个标量和2号解释为致密矩阵，所以当你将这两样东西结合在一起时，数据类型被强制使得它们都是标量，你可以将它与scipy.sparse.hstack正常结合。

这里的一些测试，以证明这是多值真：

In [31]: scipy.sparse.hstack((1,2,[3],[4])) 
Out[31]: 
<1x4 sparse matrix of type '<type 'numpy.int64'>' 
    with 4 stored elements in COOrdinate format> 

In [32]: scipy.sparse.hstack((1,2,[3],[4],5,6)) 
Out[32]: 
<1x6 sparse matrix of type '<type 'numpy.int64'>' 
    with 6 stored elements in COOrdinate format> 

In [33]: scipy.sparse.hstack((1,[2],[3],[4],5,[6],7)) 
Out[33]: 
<1x7 sparse matrix of type '<type 'numpy.int64'>'

正如你所看到的，如果你在hstack至少有一个标量目前，这似乎工作。

但是，当您尝试执行scipy.sparse.hstack(([1],[2]))的第二种情况时，它们不再是标量，它们都是稠密矩阵，并且不能使用纯粹稠密矩阵的scipy.sparse.hstack。

重现：

In [34]: scipy.sparse.hstack(([1],[2])) 
--------------------------------------------------------------------------- 
ValueError        Traceback (most recent call last) 
<ipython-input-45-cd79952b2e14> in <module>() 
----> 1 scipy.sparse.hstack(([1],[2])) 

/usr/local/lib/python2.7/site-packages/scipy/sparse/construct.pyc in hstack(blocks, format, dtype) 
    451 
    452  """ 
--> 453  return bmat([blocks], format=format, dtype=dtype) 
    454 
    455 

/usr/local/lib/python2.7/site-packages/scipy/sparse/construct.pyc in bmat(blocks, format, dtype) 
    531 
    532  if blocks.ndim != 2: 
--> 533   raise ValueError('blocks must be 2-D') 
    534 
    535  M,N = blocks.shape 

ValueError: blocks must be 2-D

看到这个职位更多的见解：Scipy error with sparse hstack

因此，如果你想用两个矩阵成功地利用这一点，你必须让他们稀疏的第一，然后将它们结合起来：

In [36]: A = scipy.sparse.coo_matrix([1]) 

In [37]: B = scipy.sparse.coo_matrix([2]) 

In [38]: C = scipy.sparse.hstack([A, B]) 

In [39]: C 
Out[39]: 
<1x2 sparse matrix of type '<type 'numpy.int64'>' 
    with 2 stored elements in COOrdinate format>

有趣的是，如果你试图做你的hstack，或numpy.hstack茂密的版本做了什么，那么它是完全可以接受的：

In [48]: import numpy as np 

In [49]: np.hstack((1, [2])) 
Out[49]: array([1, 2])

....东西弄脏了稀疏矩阵表示¯\_(ツ)_/¯。

scipy.sparse.hstack（（[1]，[2]）） - >“ValueError：blocks must be 2-D”。为什么？

相关推荐