intel,trusted,trusted-computing,mle
No. The SRTM is always your firmware. And tboot itself is not your DRTM either. The DRTM is the SINIT module. Tboot is responsible for preparing the late launch and after returning from the SINIT code functions as your MLE, thus extending your dynamic chain of trust....
c++,multithreading,stack-overflow,intel,tbb
It is blocking-style parallelism plus heavy use of stack for matrices which result in the stack overflow. So, each your task reserve some stack for its data and then calls spawn_root_and_wait_for_all which in turn executes another instance of the same task which recursively keeps growing the stack. Use continuation-style programming...
linux,intel,firmware,galileo,bsp
You have issues with compiling ACPI tables. Crucial error message here is next: ASL Input: /home/mark/Quark_EDKII_v1.1.0/Build/QuarkPlatform/DEBUG_GCC44/IA32/QuarkPlatformPkg/Acpi/AcpiTables/AcpiTables/OUTPUT/Dsdt/Platform.iiii - 1301 lines, 48788 bytes, 568 keywords Compilation complete. 15 Errors, 0 Warnings, 0 Remarks, 31 Optimizations Probably your ACPICA tools outdated. You need to figure out, which version of ACPI compiler (from ACPICA...
movq counterI, %r9 movq $0, %r9 #used for debugging only shold not be 0 The second instruction makes the first useless but given the remark I understand you will remove it. Better still, you can remove both if you would change every occurence of %R9 into %R11. movzbq (%r10, %r9,...
x86,cpu,x86-64,intel,hyperthreading
That picture is not entirely accurate, much less is duplicated. Almost everything is shared, except: the architectural state, obviously on SB and IB, the loop buffer In particular, the decoders are shared, and alternate between the threads. In principle you could have hardware threads "go down both paths" at a...
The vocabulary used in intrinsics comes straight from the name of the assembly instructions each of them is translated to. In particular, “packed” is used in opposition to “scalar”, where a scalar instruction operates on one value and a packed one operates on a vector of as many values fit...
First of all, please note that you're looking at unoptimized compiler output. Compiler output often ends up looking kind of stupid with optimizations turned off because the compiler literally translates every line of C into an equivalent run of assembly without bothering to even do the simplest, most obvious optimizations....
android,virtualization,intel,haxm
In this case you can download and use ARM images that do not need hardware virtualization instead of using x86 system images. Need to tell you that ARM images will be very slow compared to x86 images. You can download ARM image of respective APIs by selecting this option. ...
Modern Intel CPUs actually implement many instructions using so-called microcode. Microcode consists of code written in a simpler low-level instruction set used to implement high-level instructions (for example, a rep-prefixed instruction might be implemented as a microcoded loop). Because this effectively requires the CPU itself to "compile" your input instruction...
c,windows,assembly,intel,processor
NO you cannot do that on Windows without writing a kernel mode driver because you're running in long mode (the 64bit extension of protected mode). What you are asking to do is impossible in user mode From the linked blog post from Raymond Chen: Windows NT didn't have a lot...
A x64 native (AMD64 or Intel 64) processor is only mandated to support SSE and SSE2. SSE3 is supported by Intel Pentium 4 processors (“Prescott”), AMD Athlon 64 (“revision E”), AMD Phenom, and later processors. This means most, but not quite all, x64 capable CPUs should support SSE3. Supplemental SSE3...
The exact workings of branch predictors will vary between processors. But nearly all non-trivial branch predictors need a history of the branches in the program to function. This history is recorded in the branch history buffer. These come in multiple flavors. The two most commonly studied are: Local History -...
Pass the XED_MACHINE_MODE_LONG_64 flag instead of the XED_MACHINE_MODE_LEGACY_32 to decode the instruction using 64-bit instruction decoding rules.
memory,graphics,cuda,fortran,intel
Since you tagged this question with the CUDA tag, I'll offer a CUDA answer. Not sure if it really makes sense given your environment. I haven't tested this on IVF, but it works on gfortran and PGI fortran (linux). You can use the fortran iso_c_binding module available in many implementations...
Instead of: mov edx, TYPE array mov eax, [ebp + 8] mul edx add eax, 12 mov eax, [ebp + eax] write: mov edx, TYPE array mov eax, [ebp + 8] mul edx add eax, [ebp+12] mov eax, [eax] ...
c,opencl,gpgpu,intel,macbookpro
This appears to be a bug with Apple's OpenCL implementation. According to clGetDeviceInfo(..., CL_DEVICE_VERSION, ...), the Intel HD4000 supports OpenCL 1.2 under OS X 10.9. This means that it has to support double precision, since this is a core feature as of OpenCL 1.2. I've just tested an even...
You are experiencing an issue associated to Cross Origin Resource Sharing. I recommend that you include a reference to the xhr.js script in the head tag of your index.html file. This should resolve this issue. <script src="xhr.js"></script> ...
android,android-studio,virtualization,intel,haxm
There's also the NX, XD bit setting. bcdedit /set nx AlwaysOn Make sure you use the Haxm from the web site instead of using the one downloaded through Android Studio). https://software.intel.com/en-us/android/articles/intel-hardware-accelerated-execution-manager If you're using Avast, disable "Enable hardware-assisted virtualization" under: Settings > Troubleshooting. ...
To add two 128-bit numbers x and y to give z with SSE you can do it like this z = _mm_add_epi64(x,y); c = _mm_unpacklo_epi64(_mm_setzero_si128(), unsigned_lessthan(z,x)); z = _mm_sub_epi64(z,c); This is based on this link how-can-i-add-and-subtract-128-bit-integers-in-c-or-c. The function unsigned_lessthan is defined below. It's complicated without AMD XOP (actually a found...
android,android-studio,virtualization,intel,bios
- Cpu has to support ND/XD (execute disable) bit & Virtualization (check ark.intel.com) - Dowwnload the separate installer rather than install thru Android studio at https://software.intel.com/en-us/android/articles/intel-hardware-accelerated-execution-manager - Set vt in BIOS. - Turn off Hypervisor and ND bit on with these 2 commands: bcdedit /set hypervisorlaunchtype off bcdedit /set...
You're far away from a correct, let alone a good program. First, there's nothing in your arguments on the stack to say how many points there are. Second, you should code in a HLL and then translate, using the HLL for comments. Third, you need to learn more about addressing...
memory,assembly,reverse-engineering,intel,xor
The numbers displayed are all in hex and you have forgotten to use proper endianness. If the user input was ascii 1234 that means the memory contains the bytes 31 32 33 34. Since x86 is little endian, the operand 1234567 is byte sequence 67 45 23 01. Performing the...
android,intel,intel-xdk,construct
The error message is already telling you what the problem is: Your system is configured to redirect all traffic to your local machine to a proxy server. Proxy servers are servers that are used to forward network traffic. They are often used in corporate networks and the like to allow...
Yes. Branch addresses with immediate or register operands are prefixed with asterisks. From this page: Branch addressing using registers or memory operands must be prefixed by a '*'. ...
c++,multithreading,parallel-processing,intel,tbb
Instantiation of tbb:task_scheduler_init object is optional. TBB has lazy auto-initialization mechanism which constructs everything on the first call to TBB algorithms/scheduler. Auto-initialization is equal to construction of a global task_scheduler_init object just before your first call to TBB. Of course, if you need to override the default number of threads,...
assembly,concurrency,intel,memory-model
The naming is a bit awkward. The "forwarding" happens inside a core/logical processor, as follows. If you first do a STORE, it will go to the store buffer to be flushed to memory asynchronously. If you do a subsequent LOAD to the same location ON THE SAME PROCESSOR before the...
Have you looked at the rdtscp instruction? This is the read serialized version of rdtsc. For benchmarking I would recommend to read this whitepaper. It provides a couple of best practices for measuring clock ticks. Alex(Intel)...
CPUID.1:EBX[23:16] represents the maximum number of addressable IDs (initial APIC ID) that can be assigned to logical processors in a physical package. Source. So 16 has nothing to do with the actual number of your logical CPUs. On my machine CPUID.1:EBX[23:16] also returns 16, though it has 8 logical...
If that laptop doesn't run any other resource hug, if the disks are not super-slow (because they may be old, fragmented, etc.), and if that database is not very big on itself, and if no more than 1 or 2 sessions get connected to it, 8Gig and i5 should be...
c,struct,bit-manipulation,intel,cpu-registers
You have tried two ways to do the same thing, and I wouldn't trust either of them. First, bit fields. Don't use them! The ordering of bit fields is unreliable, the behaviour of anything other than unsigned int is unreliable, the distribution of bit fields across struct members is unreliable....
I install this update of Microsoft and solved above problems: Intel Corporation driver update for Intel(R) HD Graphics 3000 ...
x86,intel,protection,memory-segmentation,privilege
As a general rule when you access a code segment directly you can only do that for segments with the same privilege. This is what the Non Conforming Code Segments (NCCS) are used for. Intel made has the favor of introducing also Conforming Code Segment (CCS) that can be accessed...
android,android-emulator,intel,hyperthreading,haxm
After Doing lot of search on internet about this problem I Finally found out a solution which is working for me. It is a Hardware issue and I switched to windows 8 PRO version and then Everything worked just Fine. Have a look at this question it also says the...
You're kernel seems to be pretty long. I suspect you are TDR'ing (timing out) out and Linux (Beignet) handles this more silently than Windows. Hence, I have a couple ideas. Check dmesg for a TDR message. I haven't used Beignet or a Linux OpenCL implementation for that matter, but the...
c++,assembly,embedded,intel,int128
Update: Since the OP hasn't accepted an answer yet <hint><hint>, I've attached a bit more code. Using the libraries discussed above is probably a good idea. While you might only need a few functions today, eventually you may find that you need one more. Then one more after that. Until...
This was an interesting one. It's because getc is made threadsafe version at the point when the queue is instantiated and so the time increase is the grab-release cycle of the locks - I'm not sure why/how this occurs but that is the decisive point on the AMD OpenCL SDK...
visual-c++,driver,opencl,intel,amd-processor
The SDK gets you the include files (e.g, cl.h) and library (OpenCL.lib, which loads OpenCL.dll at runtime). It does NOT select which devices your program can access. Therefore using either SDK should get you access to both devices, as well as any OpenCL GPU devices in your system. Your program...
c,compiler-construction,codeblocks,intel
I guess your Visual Studio 2010 is simply to new to work with the Intel Compiler 10.1.014 - or in other words your Intel Compiler version is too old. VS 2010 was released 2 years after ICC 10.1.014. Even VS 2008 is not officially supported in your compiler version. I...
Please read John M's articles and code samples https://software.intel.com/en-us/articles/intel-digital-random-number-generator-drng-software-implementation-guide section 2.2 covers pseudo and 2.3 covers true
assembly,compare,nasm,intel,fpu
FPU math should never afect to CPU flags like carry and zero! So copy flags from FPU to CPU flag register after fcomp instruction and than check the carry and zero flags like: fld qword ptr [a] fcomp qword ptr [b] wait ;wait FPU fstsw ax ;copy FPU flags to...
memory,graphics,intel,computer-architecture,vga
it has been quite a few years I wrote something directly for VGA so take that in mind. The old legacy stuff (CGA/EGA,VGA) mapped all VRAM memory access to two segments only (2 x 64KByte) 1.graphic modes A000:0000 - A000:FFFF 2.text modes B800:0000 - B800:FFFF So these two 64KByte chunks...
user-defined-functions,vectorization,intel,icc
A traditional function can only be converted when the parameter to the function falls in one of the following catagories (This is a current limitation documented at https://software.intel.com/sites/products/documentation/doclib/iss/2013/compiler/cpp-lin/GUID-42986DEF-8710-453A-9DAC-2086EE55F1F5.htm under "Restrictions on using vector declaration"): (un)signed 8, 16, 32, or 64-bit integer 32 or 64-bit floating point 64 or 128-bit complex...
There are two major syntax forms for x86 assembly language - Intel and AT&T, and percent sign indicates AT&T one, which is historically used in unix universe. It's not only about naming. Operation arguments direction is reversed, addressing is a little different, and some other things....
gpu,intel,gpgpu,hardware-acceleration,xeon-phi
The Xeon Phi requires a large degree of both functional parallelism (different threads) and vector parallelism (SIMD). Since the cores are essentially enhanced Pentium processors, serial code runs slowly. This will change somewhat with the next generation as it'll use faster and more modern cores. The current Xeon Phi also...
android,cordova,intel,intel-xdk
You don't need to add the PERMISSION, that is a very rare requirement. In most cases, the plugins will provide the appropriate permissions required, that is why you are getting the duplicate message in the build log. Regarding your APK not working, did you select the appropriate Intel XDK plugin?...
Probably, the progress indicator that you looked at takes a while to notice the change in rate. Maybe the progress is calculated as an average from the total duration and the total number of files deleted so far. Some disks have hardware parallelism (mainly SSDs). They can provide more throughput...
c++,multithreading,parallel-processing,intel,tbb
I've modified the example from thread_bound_filter documentation page in order to make the first filter as thread-bound. It works fine and I think it is what you need: #include <iostream> #include "tbb/compat/thread" #include "tbb/pipeline.h" using namespace tbb; using namespace std; char InputString[] = "abcdefg\n"; class InputFilter: public thread_bound_filter { char*...
Multiple programs can use the same raw data stream - the first one to take control of the camera gets to control the settings but others can use the stream. Be sure to install the Depth Camera Manager (DCM) that acts as the single voice to the camera.
Use the volatile keyword when assigning your variable to let the compiler know not to optimize it. As far as I know, this is a C/C++ standard so it should work on any compiler. See the MSDN link for more info.
c,multithreading,caching,intel,preload
It could be possible in some conditions - check if your CPU supports "DCA" (Direct Cache Access), and if you can activate this feature. This might be useful: https://www.myricom.com/software/myri10ge/790-how-do-i-enable-intel-direct-cache-access-dca-with-the-linux-myri10ge-driver.html I don't think you really need this though, going over the entire array sequentially should be very efficient as it would...
It allocates space on the stack for local variables. First BP it set to the current value of SP, then SP is decremented to make room for the local variables used by the function. As you can see, later [ss:rbp+???] is used to access parts of memory of this reserved...
x86,cpu,x86-64,intel,cpu-architecture
yes. The question already contained the answer, as the comments explained. :P (just posting an answer to get this out of the unanswered questions list.) I will add that Sandybridge and later Intel CPUs, with their uop cache, can more often come close to sustaining 4 uops per cycle in...
You can see how much memory is used for the computation whether it is malloc or static assignment. Then check the size of the L1 cache. For L1 data cache on MIC, it is 32 KB. From this, calculate the size of the data so that it can fit in...
You forgot to initialise DS. Change mov ax,data mov es,ax to mov ax,data mov ds,ax mov es,ax Is the initialisation of ES really needed?...
Byte, word, quad, etc are called size modifiers. The w in movw is called a size suffix....
I got the same problem.I will tell you how to solve it. Do u use avast antivirus? If yes then you have to disable Hardware assisted virtualization in Settings->trobleshooting tab of avast. If no then download this software from the link below.It will tell you whether the VT-x is enable...
c++,gcc,intel,static-members,constexpr
Because the function f takes a reference argument, there has to be a definition of S::ce that a reference can point to at runtime; the compiler can't just replace the argument with a literal 42. So you have to add an out-of-class definition: const int S::ce; just like you would...
Linux dynamic linker supports a special symbol type called STT_GNU_IFUNC. Strcmp is likely implemented as an IFUNC. 'Regular' symbols in a dynamic library are nothing more but a mapping from a name to the address. IFUNCs are a bit more complex than that: the address isn't readily available, in order...
android,intel,android-avd,haxm
HAXM means Intel® Hardware Accelerated Execution Manager which you can find in the BIOS as virtualization, first of all make sure your CPU has this feature: Go to your computer's bios and see if your cpu supports virtualization, if it does then make sure it's enabled and restart saving changes....
android,push-notification,intel,appframework
Ive found the Solution. I have forgott to type the IP 0.0.0.0/0 into the Google API Console. And i added the Google API Key and Projectnumber by the Push Category on Build Window. Then i copy paste the code from http://docs.appmobi.com/index.php/push-messages/ into index_user_scripts.js (only number 2 and 3). Number 1...
A few problems there: You only reserve 27 bytes at exec but you need 28 so you overwrite the start of buffer. What you are seeing in gdb is part your exec block followed by the buffer. Notice you have 3 D placeholders left, but you will write 4 more...
android,windows-7,android-studio,intel,hyper-v
Check this link to know about your processor support Intel VT. http://ark.intel.com/Products/VirtualizationTechnology It might not supported maybe.. Or you can use Arm emulator. Add new emulator and select armeabi-v7a System image....
c++,visual-studio-2012,openmp,intel,gpgpu
To your first question: ICC supports only Intel MIC for offloading. So you cannot use OpenMP 4.0 with ICC to program your Nvidia GPU. My knowledge may be outdated but the only available OpenMP 4.0 compiler which targets Nvidia GPUs is the Cray compiler Cray sells with their clusters. Maybe...
Is it because of some kind of ARM emulation? Yes. It is called libhoudini. Quoting myself: libhoudini is a proprietary ARM translation layer for x86-powered Android devices. It allows an app that has NDK binaries for ARM, but not x86, to still run on x86 hardware, albeit not as...
gcc,g++,arm,cross-compiling,intel
A lot of optimization options exist in GCC. By default the compiler tries to make the compilation process as short as possible and produce an object code that makes debugging easy. However, gcc also provides several options for optimization. Four general levels of incremental optimizations for performance are available in...
This is related to DR 1467 which did not allow using the list-initialization syntax for copying aggregates. This was recently fixed for classes in GCC and I extended the fix to vectors in r209449. Gcc-4.10 compiles your code.
The very unsatisfying answer to your question is that the MOF files have a lot of issues and a bug with the SCS team has been filed so this is a known problem. I am not allowed to send you the edited mof files that may fix things for you...
opencv,intel,tbb,google-nativeclient
For now there is no official port of Intel TBB for NaCl, and the project team at Intel (which I work in) is unaware of any unofficial one either.
The key is that you say, "I do nothing in the for loop." (Please correct me if I'm mistaken.) Because of practical limits when the Xeon Phi was created, its cores are based upon a Pentium generation machine with various enhancements, such as dual issue, 4 threads per core, and...
For now, it would be best to uninstall both of these packages (intel-opencl-sdk and intel-opencl-runtime) and install beignet from the AUR. https://aur.archlinux.org/packages/beignet/ The package provides the same functionality and allows you to use the Intel GPU cores also. I can confirm that it coexists well with other OpenCL platforms such...
cpu,intel,cpu-architecture,flops,nehalem
Nehalem is capable of executing 4 DP or 8 SP FLOP/cycle. This is accomplished using SSE, which operates on packed floating point values, 2/register in DP and 4/register in SP. In order to achieve 4 DP FLOP/cycle or 8 SP FLOP/cycle the core has to execute 2 SSE instructions per...
c++,multithreading,gpu,intel,cilk
Yes, it is possible. But it's not straightforward. You can use CUDA, OpenCL or DirectCompute for what you want, however you need to write a parallel version of your function. Running a "sequential" function on a GPU is definitely not interesting. Parallelising the algorithm might be the hardest task, underlyings...
Using a variable that starts with an underscore followed by a capital letter is undefined behaviour. Don't do it. (I've seen _MT standing for "use multithreaded runtime")....
As far as I know you can only offload OpenMP code on Intel MIC/Xeon Phi. However in the (near ?) future OpenMP 4 should offer this kind of feature (see this post). So GPGPU on Intel HD graphics can only be done with OpenCL and Intel CILK for the moment...
streaming,intel,video-encoding,intel-media-sdk
The Media SDK provides APIs that can be executed on either core (SW implementation), or on the GPU/fixed-function-logic (HW-accelerated Implementation), depending on the system and its capabilities. "mfxIMPL impl" -> Use software, or hardware or best available implementation. We recommend using MFX_IMPL_HARDWARE, or MFX_IMPL_AUTO if you are unsure of the...
arguments,fortran,intel,complex-numbers,intrinsics
Fortran 2008 allows complex argument. Some compilers already allow this. If your does not (as, e.g., ifort 15.0), compute it using exp(). cosh(x) = ( exp(x) + exp(-x) ) / 2 or use the identity cosh(x+iy) = cosh(x) * cos(y) + i * sinh(x) * sin(y) ...
In my matrix multiplication code I only have to use the broadcast once per kernel code but if you really want to load four doubles in one instruction and then broadcast them to four registers you can do it like this #include <stdio.h> #include <immintrin.h> int main() { double in[]...
All this absolute value business is unnecessary and frankly confusing and it has the edge-case that Math.abs(Integer.MIN_VALUE) is negative (not that weird when you get down to it, but it doesn't look like this code expects it), you can calculate the carry in simpler ways . For example using the...
osx,format,osx-mavericks,intel
Firstly write this command in the terminal: diskutil list Then it will show a list of disks. Find the one which caused the problem and use this command for it: sudo diskutil unmountDisk /dev/disk4 Now it should be erased successfully if you get back to the Disk Utilities program....
floating-point,intel,floating-point-precision,floating-point-conversion
Problem was mxcsr register's DAZ bit. One machine A had it set so it was treating denormals as 0 . Where as on machine 2 DAZ was reset and hence it was normalizing it and treating denormals as different values , which resulted in comparison result as not equal. But...
python,interface,arduino,intel,galileo
I have very similar code working with my Galileo interfacing with Python, and I notice two differences that could be causing your errors: When I make the system call, I save it as a file, not a text file: system("python /media/realroot/Files/tweetcheck.py > /media/realroot/result"); Maybe saving it as a text file...
I've run into the same issue recently as well. We had some database issues that hadn't resolved themselves over the weekend, in turn causing a massive influx of 1074kb logfiles (my count was roughly 25,000) I'm doing something similar on a weekly basis, just incase any new information appears. I'd...
Let's look at a sequence of instructions: 1. nop #call-stack is aligned to 16 bytes (sp is multiple of 16) to start. 2. call function #pushes return address (4 bytes) onto stack. ---(called function) 3. push %ebp #pushes base-pointer (4 bytes) onto stack, which is now 8-byte aligned ---cannot call...
You have to install the drivers for your integrated onboard gpu. This can be done by booting up while using iGPU from bios settings, and your pc shall be able to load the drivers it needs on its own. For my Ivy bridge, the bios settings are these: Go...
linux-kernel,virtualization,intel
The hardware limitations mostly concern old devies. For example, you have the ZONE_DMA, which is from 0 - 16MB. This is e.g. needed for older ISA Devices, which are not capable of adressing above the 16MB limit. Then you have the ZONE_NORMAL, where most of the kernel operations take place...
Flash can always be used to put static assets for your application, including microblaze program, configuration options, image, sound, etc. Most importantly, the fpga bitstream must resides on flash. All current xilinx fpgas are SRAM based and loose their content at poweroff. You need a non-volatile flash to store the...
windows,driver,intel,device-driver,realsense
If you are also showing an integrated webcam (other than the 3 RealSense listings), try disabling that so the RS camera can be seen as the primary.